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The Welch Foundation 
; Celebrates Outstanding 
= Achievements in Chemistry 


The Welch Award in Chemistry is presented annually by the Robert 
A. Welch Foundation to foster and encourage basic chemical research 
and to recognize the value of chemical research contributions for the 
benefit of humankind. The Welch Foundation proudly presents the 
2022 award to Dr. Carolyn R. Bertozzi. 


Dr. Bertozzi is a leading figure in the field of chemical biology who has made major strides 
with her study of cell surface sugars. With this award is being recognized for her 
innovative contributions to bioorganic chemistry and the development of bioorthogonal 
chemistry, all of which have instrumental applications in the study of human health and 
“ CAROLYN R. 
disease. Upon accepting the award, Dr. Bertozzi will receive $500,000 and a gold medallion. BERTOZZI 
Anne T. and Robert M. Bass 
Professor of Chemistry, 
Stanford University 


— 
THE 
Welch 


FOUNDATION 


Advancing Chemistry. Improving Life. 


Register now for the 2022 Welch Conference at welch1.org 
HOUSTON, TX | OCTOBER 24-25, 2022 
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Genomics, Bio-Rad, and 


YOU. 


PCR. qPCR. ddPCR. 
Bio-Rad has the tools and support you need. 


Take Your Gene Expression Research to the Next Level 


At Bio-Rad, genomics innovation is in our DNA. Our technologies provide 
the robust, reliable results you need. With the leading advancements in both 


quantitative PCR (qPCR) and digital PCR (dPCR), you can unlock greater BIO RAD 
accuracy and reproducibility with easy-to-use instruments. See what you can 
discover with versatile nucleic acid quantification solutions from trusted experts. 


Learn more about how Bio-Rad can support your genomics 
research at bio-rad.com/genomics-solutions 
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Advancing the future of scientific research 


To help scientists accelerate scientific innovation, CAS, a division of the American 
Chemical Society, has developed the CAS SciFinder Discovery Platform™, a suite of 
powerful research tools. 


With tens of thousands of scientific journals and millions of peer-reviewed articles 
published annually, finding the right information gets more complicated every year. 
CAS solves that problem with the CAS SciFinder Discovery Platform. “It's basically 
a search engine for scientists and for the different roles within a company,” says 
Jian Wu, senior director of program management at CAS. “We have a suite of 
products that all come together in a platform that helps scientists do their research 
and development.’ That assistance includes powerful methods for searching the 
scientific literature and more. 

With more than a century of experience in the information business, CAS makes 
the perfect partner for scientists seeking answers to questions. "We have over 100 
years of knowledge in how to collect, aggregate, and organize information,” Wu 
says. "We are a technology company that specializes in scientific information.” 

Although the amount of scientific information grows rapidly, scientists don’t have 
the luxury of unlimited hours to explore it. So, CAS combines human curation from 
scientific experts with advanced technology, including artificial intelligence (Al), to 
“aggregate information and disseminate that information in a way that's efficiently 
digestible for the users,” Wu explains. 

In particular, the CAS SciFinder Discovery Platform was developed, Wu says, 

“to solve some very important pain points and problems that scientists face.” For 
example, he explains that scientists often have questions that could already be 
answered using existing scientific information, but that information is not always 
easy to find. "We help by either pointing scientists to the right documents to read or 
we just flat out give them the answer within the products,” Wu says. 


To do this, the CAS SciFinder Discovery Platform provides three tools that can work 
together. In this way, it becomes a one-stop shop for scientists seeking information. 


Atrio of tools 

“The big gorilla, called CAS SciFinder", contains a Google-like search engine that 
is specialized for scientists,” Wu says. Based on the CAS Content Collection"— 
including articles, patent information, and more—scientists can use CAS SciFinder" 
to explore chemical names, structures, regulatory information, and other scientific 
information. Life scientists also find many uses for CAS SciFinder’. For example, it 
includes more than 54 million references related to biologics. 

When a scientist has questions about a chemical reaction, the CAS Analytical 
Methods™ solution can help. “This is built for analytical chemists,” Wu says. Here, 
chemists can explore detailed methods for millions of chemical procedures and get 
step-by-step instructions for replicating a reaction. Also, methods can be compared 
to determine differences in conditions, reagents, and instrumentation. Currently, the 
CAS SciFinder Discovery Platform includes almost 150 million reactions, almost 200 
million substances, and over 700 million genetic sequences that can be explored, 
and all of these numbers continue to increase. 

Formulation scientists looking for approaches to producing paint, a 
pharmaceutical pill, shampoo, or other products can use CAS Formulus® to explore 
a database of formulations and associated workflows. In this way, formulators 
can more quickly find methods that are safe and effective, all while maintaining 
regulatory compliance. 

In addition to all the power behind this trio of tools, the CAS SciFinder 
Discovery Platform includes an intuitive user interface. It's so easy to use that 
Wu says, “We've seen people comfortably using it within 30 minutes.” That ease 
of use comes from the hard work CAS has done to develop this solution. “We are 
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scientists making tools for other scientists,” Wu says. "So we understand our 
users, talk to them all the time, and bounce ideas off each other.” What Wu calls 
“heavy interaction with the end users” forms the foundation of this easy-to-use but 
designed-for-purpose platform. 


Making new medicines 

When a biotechnology or pharmaceutical company wants to make a new treatment, 
the CAS SciFinder Discovery Platform can simplify many steps. As an example, Wu 
discusses how it can be used in drug discovery. “When a company is trying to figure 
out how to create a compound—a chemical compound that's part of the drug—they 
need to do a lot of research on the different ways that have been tried already, 
including what has worked and what has not worked,” he says. "So that's one very 
predominant use case.” 

Companies can even use the platform to map out the entire drug development 
process. “Our aspiration is to serve R&D with all of the information for the entire 
drug-discovery workflow all the way up to clinical trials,” Wu says. “We're also 
investing heavily in building our predictive capabilities.” As an example, CAS uses 
technology that predicts what questions companies might ask. Wu says that this 
includes “asking about the different effects of a compound on a target molecule, and 
what effect that target molecule has on a disease—based on existing publications, 
and accurate predictions of the future.” 

Those predictions come from expert knowledge plus computation. “There's Al 
and machine learning, but what fuels that is the data, and our data are expertly 
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curated,” Wu says. “So I'd say that’s our secret sauce.” 


Produced by the Science/AAAS Custom Publishing Office 


In addition, companies can apply the CAS SciFinder Discovery Platform to 
many other aspects of drug development. For instance, any intellectual property 
issues of a compound can be explored. “You can also source the materials to make 
something, because we have capabilities that can link you to chemical suppliers,” 
Wu says. “One thing that is really cool is our tool that does retrosynthesis, which 
offers alternative paths for how to make something, and it provides the different 
reaction conditions, the byproducts, and so on.” 


Making science more efficient 

By using the CAS SciFinder Discovery Platform, scientists can even improve 

the sustainability of their work. For example, a scientist might be planning an 
experiment but find through a search that it has already been done. “Maybe it didn’t 
work out," Wu says. “All of our tools really highlight what's been experimented with 
before, and avoiding repeats of failed experiments saves time and materials and 
protects the environment, which is really crucial.” CAS will enhance contributions 
to sustainability even more in the future. "We have plans to implement green 
sustainable features,” Wu adds. 

Beyond adding green chemistry, CAS is expanding the CAS SciFinder Discovery 
Platform to benefit all of the life sciences. “We're working on addressing information 
in biology and medicinal chemistry—things beyond core chemistry,” Wu says. 
"That's going to expand over the next 5 to 10 years, and after that, we'll work on 
information in the material science of polymers." 

To summarize the value of the CAS SciFinder Discovery Platform, Wu says, "We 
get you all the information you need and minimize the time you need to spend 

before you go to the bench." That generates efficiency in time and finances. 
Even more important, that information helps generate better outcomes 
from experiments and even from new drug manufacturing. 
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It’s time to think 


Gaining a better understanding of infectious diseases, / 
including their characterization, evolution and transmission, 
continues to be a priority, both from an R&D standpoint 

and as a public health issue. The COVID-19 pandemic has 

demonstrated the need for a wide range of tools to research | 
infectious diseases, and has highlighted the importance 7 

of speed and the ability to pivot as new problems arise. 

This has emphasized the need for innovation and thinking 

differently about where to access those critical materials, 

including genomics reagents. 


Many scientists know NEB as a trusted reagent supplier to 
the life science community, but what you may not know is 
that we also offer a portfolio of products that can be used 
in infectious disease research, development of diagnostics 
and therapies, and in epidemiological studies and disease 
surveillance. In fact, many of our products have supported 
the development of COVID-19 diagnostics and vaccines, 
and can also be utilized with other infectious diseases, such 
as influenza and malaria. 


“GMP-grade” is a branding term NEB uses to describe reagents manufactured at our Rowley, MA facility, where we utilize procedures and process controls to 
manufacture reagents in compliance with ISO 9001 and ISO 13485 quality management system standards. NEB does not manufacture or sell products known 
as Active Pharmaceutical Ingredients (APIs), nor do we manufacture products in compliance with all of the Current Good Manufacturing Practice regulations. 


One or more of these products are covered by patents, trademarks and/or copyrights owned or controlled by New England Biolabs, Inc. For more information, 


please email us at busdev@neb.com. The use of these products may require you to obtain additional third party intellectual property rights for certain applications. 


© Copyright 2022, New England Biolabs, Inc,; all rights reserved. 


differently. 


Find out how NEB can support your infectious disease research and development. 


Benefit from almost 50 years of experience in 
molecular biology & enzymology 


Partner with our OEM & Customized Solutions 
team to find the best solution for your needs 


Take advantage of our expanded manufacturing 
capabilities 


Access product formats, such as GMP-grade’, 
lyophilized, lyo-ready and glycerol-free 


Be confident in your product performance with 
our expanded quality and regulatory systems 


Ready to get started? Learn more at 
www.neb.com/InfectiousDiseases 
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Japan's Super Kanhioland neutrino detector walle soon be surpassed by Hyper-K. 
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1474 Room-temperature 
superconductivity claim retracted 

After doubts grew, blockbuster Nature 
paper is withdrawn over objections of study 
team By E. Hand 
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1475 University pandemic 


policies raise equity worries 
Tenure delays and pandemic 
impact statements could backfire, 
some fear By K. Langin 


1477 Signs of state 

meddling seen in Russian 
academy election 

Leader of Russia's largest 
chipmaker elected president after 
incumbent's sudden withdrawal 
By O. Dobrovidova 


1478 Fraud charges crumble in China 
Initiative cases 

Judges reject claims that defendants 
defrauded agencies by not disclosing 
Chinaties By J. Mervis 


1479 New Omicron strains may portend 
big COVID-19 waves 

Emerging subvariants are more immune 
evasive than ever By G. Vogel 
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The United States and Japan are embarking 
on ambitious efforts to wring a key 
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phantoms known as neutrinos By A. Cho 


INSIGHTS 
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gas flaring 

Emissions from flaring threaten the global 
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1489 The genetics of a long life 
Genetically diverse mice and cross-species 
comparison uncover links to longevity 
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Earh's composition was shaped by collisions 
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and diversity analysis in research 
Funding agencies have ample room 

to improve their policies By L. Hunt et al. 
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A once-ridiculed sculpture display is worth 
asecond look By I. Nieuwland 


1497 Spatial thinkers recieve their due 
An insider's guide to visual reasoning 
celebrates those who think in pictures and 
patterns By J. Wai 
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Presented by BIl & Science 


_ BIl) Prize for 
Science ||nnovation 


Bringing research / 
to life-and science 
to market 


Behind every life-changing solution is an 
entrepreneurial scientist-a creative mind 
who proved an idea in the lab and dared 
to carry it out in the world. 


To encourage more scientists to translate their 
research, Biolnnovation Institute (BII) and 
Science present a new annual award. 


Our three winners will have their essays published 
in Science magazine and will be invited into 
BIl’s entrepreneurial ecosystem. In addition, 
the Grand Prize winner will receive a prize 
of USD 25,000, and each runner-up 

will receive USD 10,000 at a grand award 
ceremony in Copenhagen, Denmark. 


The call for applications has just opened. 
Apply before November 1, 2022. 
www.bii.dk/scienceprize 


See film from the 
grand award 
ceremony 2022 


; ; : Apply before November 1, 2022 
Biotnnevation Science www.bii.dk/scienceprize 
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Two samples of rock (top and bottom holes) 
were collected by the Perseverance rover from 
this outcrop of the Séitah geologic formation in 
Jezero crater, Mars. Also visible are a discarded 
sample attempt (middle hole), a rock abrasion 
patch (lower left depres- 
sion), and the rover's 
tracks and shadow. 
Analysis of the Séitah 
formation shows that it 
consists of igneous rocks 
modified by liquid water. 
See pages 1512 and 1513. 
Image: NASA/JPL-Caltech 
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Building the Future, 
One City at a Time 


Our WINLAB is taking tomorrow's tech out of the lab 
and into the streets of New York City 


Building on their extensive research with wireless testbeds, radio technology, Smart intersections that 
and mobile internet architecture, the Rutgers team at the Wireless Information ‘ah sage oe vn 

information in real time 
Network Laboratory (WINLAB) is helping to implement COSMOS, a real-world 


outdoor lab putting next-generation wireless technology to work. 
Cloud-based autonomous 
A major partnership between WINLAB and other local innovation leaders, weniciss forsenente : 
driverless transportation 
with funding from the National Science Foundation, COSMOS will be 


deployed across a square mile in Manhattan's vibrant, densely populated 

Remote surgery enabling 
doctors to operate on patients 
across great distances 


West Harlem neighborhood. 


These urban wireless networks of the future will support data-intensive 


applications for a smarter, safer world. 
Augmented and virtual reality 
to enhance interactivity and 


[=] [=] Vy provide new experiences 
Excellence is Earned 
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EDITORIAL 


Better data, less gun violence 


or 25 years, the US government funded little re- 
search on firearm violence prevention. Although 
some dedicated researchers made important dis- 
coveries over this period, the scale of the research 
effort was not commensurate with the problem. 
Recently, however, there has been an unprecedent- 
ed surge in research funding: the National Collab- 
orative on Gun Violence Research, a private philanthro- 
py, has awarded more than $21 million since 2018; the 
federal government has committed $25 million per year 
since 2019; and some states and other philanthropies 
have recently invested in such research programs. 

The new opportunities this funding creates to under- 
stand and prevent firearm violence come at a critical mo- 
ment. Firearm violence began increasing in 2015, with a 
sharp rise between 2019 and 2020, and again in 2021, re- 
sulting in firearm homicide rates 
not seen in the US since the end 
of the crack cocaine epidemic of 
the early 1990s. Although these 
increases occurred for nearly ev- 
ery demographic and geographic 
subgroup in the nation, they are 
highly concentrated among peo- 
ple who are Black, a group already 
facing firearm homicide rates 10 
times greater than those of white 
people. Similarly, firearm suicides 
and suicides generally have soared 
in recent years, with preliminary 2021 counts showing a 
firearm suicide rate in the US greater than at any point in 
the past 43 years. 

High and rising costs of firearm violence prompt an 
urgent need to use new research funding to test and 
evaluate violence prevention strategies, many of which— 
including community-based violence prevention inter- 
ventions, state firearm laws, policing and criminal justice 
strategies, and health care interventions—have yielded 
promising initial results. However, mixed results across 
outcomes and contexts make it challenging to justify and 
sustain large-scale public investments in them. 

Effective firearm violence prevention strategies re- 
quire better understanding of the people, places, con- 
ditions, and antecedents of firearm violence, a line of 
research hampered by substandard federal data collec- 
tion and laws that prohibit the federal government from 
sharing data with researchers. Given federal investments 
in other data systems that produce information on risk 
behaviors such as seat belt use and fruit juice consump- 
tion, it is notable that the US has had no federal system 
for estimating firearm ownership at the state level for 


“Research on gun 
violence that is 


supported by new 
funding is overdue...” 


more than 15 years. Furthermore, although the federal 
government has strong systems for counting the number 
of people killed by firearms, it fails at measuring nonfatal 
firearm injuries and crimes, which constitute the major- 
ity of firearm violence. There is no system that tracks 
hospitalizations for firearm injuries in every state and 
over time. The federal government does not produce esti- 
mates of all police shootings, mass shootings, or intimate 
partner violence. And although the Federal Bureau of In- 
vestigation (FBI) maintained a data series for 90 years 
that provided state information on use of firearms in 
violent crimes, it allowed that system to lapse in 2021 in 
expectation that enough police agencies would join the 
new (improved) National Incident-Based Reporting Sys- 
tem. They have not. At last report, only 11,490 of 18,933 
agencies contributed data through the new system, and 
the FBI has warned that it will 
be unable to produce 2021 crime 
rate estimates or crime trend in- 
formation for all states. Without 
high-quality measures of nonfatal 
firearm injuries and crimes, the 
prevalence and costs of interper- 
sonal firearm violence are under- 
reported, and research efforts 
to understand relationships be- 
tween firearm interventions and 
key outcomes are undermined. 

In some cases, federal law 
needlessly prevents research on government data. For 
instance, in the late 1990s and early 2000s, a productive 
line of research used crime gun trace data collected by 
the Bureau of Alcohol, Tobacco, Firearms, and Explo- 
sives. These data revealed states and businesses where 
guns used in crimes originated, how crime guns moved 
across borders, the types of guns preferred for crime, 
and how policies affected crime gun flows. However, 
amendments to the Justice Department’s appropria- 
tions bills (the Tiahrt Amendments) now prohibit the 
Bureau from sharing trace data with researchers. Other 
provisions of these amendments prohibit the federal 
government from retaining records verifying whether 
individuals passed federal background checks that are 
required to purchase a firearm. 

If the United States is serious about fixing the es- 
calating problem of firearm violence, the government 
needs to measure it. Research on gun violence that 
is supported by new funding is overdue but will be 
handicapped until federal and state firearm violence 
data systems improve. 

—Andrew R. Morral and Rosanna Smart 
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European satellite duo will study oceans and warming 


he European Space Agency (ESA) last week approved 

the $420 million Harmony mission as the next in its 

Earth Explorer line of science missions, following 

a competition. Harmony’s two satellites will carry 

infrared sensors and radar receivers to observe the 

turbulent waves, winds, and eddies that govern the 
interchange of heat and gases between the oceans and atmo- 
sphere. Scientists know oceans soak up more than 90% of 
the excess heat of global warming, but they need Harmony’s 
finer scale observations to explain how—and to calibrate 
climate models that predict the evolution of these dynamics 


Fiona ruins Puerto Rico research 


FACILITIES | Scientists in Puerto Rico 

say they lost vital refrigerated samples 
and face delayed and ruined studies 

after Hurricane Fiona knocked out power 
across the island last week. The disruption 
continued this week, and authorities predict 
it may take workers weeks to restore a 
sustained power supply. Some researchers 
have transferred samples to other locations 
powered by generators. Ileana Rodriguez- 
Velez, a chemist at the University of Puerto 
Rico, Humacao, moved some still-viable 
samples of plants she is studying for their 
anticancer and antimicrobial activity 


operate reliably. 
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to refrigerators at her and her mother’s 
houses. Many researchers also endured 
outages when Hurricane Maria devastated 
the island in 2017. The restoration of the 
power grid has proceeded slowly, and after 
Fiona struck, some generators installed by 
researchers failed or lack enough fuel to 


Donor absorbs pandemic institute 


Eighteen months after 
vaccine specialist Rick Bright took the 
helm of a new institute with ambitious 
plans to help thwart future pandem- 

ics, he has left the job, and the young 


decades ahead. “We want to study how the oceans and 

the atmosphere are talking to each other,’ says principal 
investigator Paco Lopez-Dekker, a remote sensing scientist 

at the Delft University of Technology. After launch in 2029, 
the Harmony satellites will fly in formation with one of ESA’s 
Sentinel-1 radar satellites, to be launched in 2025, while 
capturing its reflected beams at different angles. The mission 
will also be able to monitor 3D ground motions as small as 

1 millimeter a year—of interest to scientists monitoring 
glacial ice loss, volcanic eruptions, and the seismic strain 
that leads to earthquakes. 


organization has been folded into the 
larger mission of its funder, the Rockefeller 
Foundation. During the administration of 
former President Donald Trump, Bright led 
a major U.S. funding agency that supports 
R&D for products to combat pandem- 

ics, but resigned from that job because of 
disagreements over the administration’s 
COVID-19 response. Questioned about 
Bright’s departure from the Pandemic 
Prevention Institute, a foundation spokes- 
person says, “The landscape has changed 
significantly” since its launch, “as there 

are now many different initiatives focused 
on creating a pandemic-free future.” These 
include ones led by the World Health 
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Organization and the U.S. Centers for 
Disease Control and Prevention. Rockefeller 
had pledged to invest $150 million over 3 
years to support outside scientific efforts 
that do pathogen surveillance and establish 
a network to share data more efficiently. 
Bright says Rockefeller was pivoting to 
focus more on climate change. “I know their 
leadership and support in this area will 
make a huge impact,” he says. 


Global disease fund nets top haul 


FUNDING | High- and low-income 

countries alike last week joined in pledg- 
ing $14.25 billion to the Global Fund to 
Fight AIDS, Tuberculosis and Malaria, in 
the largest single fundraising push for 
global health. Supporters said the sum 
was necessary to help reverse setbacks in 
combating those diseases caused by the 
COVID-19 pandemic, although the amount 
fell short of the Global Fund’s goal of 

$18 billion. The pledged amount may 
increase if Italy and the United Kingdom 
make contributions; changes this month 
in their governments delayed expected 
commitments. Many of the 45 countries 
that did pledge money, including low- 
income nations in Africa, upped their 
pledges by 30% or more despite inflation 
and other economic pressures. If achieved, 
the $18 billion could save 20 million lives 
and avoid 450 million new infections, 

the Global Fund estimates. The United 
States made the single largest commit- 
ment, up to $6 billion, but Congress must 
approve the money. The Bill & Melinda 
Gates Foundation made the largest private 
pledge, $912 million. 


‘Honorary’ authors abound 


PUBLISHING | Up to one-third of authors 
named on a large sample of scientific 
journal articles are “honorary” because 
they did not fulfill criteria for being listed 
as authors, a study has found. Some 1% 
of the authors provided only funding or 
other resources for the project, contribu- 
tions that would not qualify a researcher 
for authorship under two standard defini- 
tions. The study, presented this month at 
the International Congress on Peer Review 
and Scientific Publication, examined more 
than 629,000 authors on 82,000 papers 
published in the PLOS family of journals 
from 2017 to 2021. It was unusual because 
it examined standardized statements, 
submitted by corresponding authors, that 
described each author’s contribution. 
Previous studies have estimated the fre- 
quency of honorary authorship based on 
self-reports by scientists on surveys. 
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IN FOCUS NASA's James Webb Space Telescope has captured the clearest view of 
Neptune and its rings since the Voyager 2 probe briefly flew by in 1989. The new images, 
released last week, are the first recorded in infrared light—which explains why the 
planet does not appear blue, as it does in visible light. The gas giant’s high-altitude, 
methane-ice clouds appear as streaks and spots, NASA said. The image also shows 
seven of Neptune's 14 known moons (not all of which are visible in the version above). 


U.S. SCIENCE ADVISER The U.S. Senate voted 
56-40 to confirm Arati Prabhakar to lead the 
White House Office of Science and Techno- 
logy Policy. Prabhakar, an applied physicist 
and former venture capitalist, will succeed Eric 
Lander, who resigned as the office’s head in 
February after admitting he created a hostile 
work environment. She will also be the presi- 
dent's new science adviser, replacing Francis 
Collins, who had been acting in that role since 
Lander’s departure. Prabhakar led the Defense 
Advanced Research Projects Agency from 
2013 to 2017. 


EBOLA OUTBREAK Uganda is battling an 
outbreak of the Sudan ebolavirus, one of six 


members of the genus Ebolavirus. The coun- 


try declared an outbreak on 20 September 
and has since reported at least 

36 confirmed or suspected cases, includ- 
ing 23 deaths. Two licensed Ebola vaccines 
exist, but neither one has been approved 

for use against the Sudan ebolavirus. Mean- 
while, the Democratic Republic of the Congo 


declared on 27 September that its latest 

Ebola outbreak—a single infection with the 
Zaire ebolavirus confirmed on 22 August— 
has ended. 


U.K. FRACKING The U.K. government lifted 

a 2019 moratorium on hydraulic fracturing 
(“fracking”) to extract natural gas in England. 
A report last week by the British Geological 
Survey concluded that forecasting the small 
earthquakes triggered by fracking remains a 
challenge; it recommended drilling more test 
wells to obtain data. 


GREENHOUSE GAS TREATY The U.S. Senate 
ratified the 2016 Kigali Amendment, joining 
137 other countries in an effort to slash the 
production and use of hydrofluorocarbons, a 
class of potent greenhouse gases. Congress 
previously mandated reductions, and U.S. 
companies have begun to switch to alterna- 
tives. But the vote had symbolic importance 
as the first international climate treaty rati- 
fied by U.S. lawmakers since 1992. 
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Room-temperature superconductor claim retracted 


Atiny diamond vise 
squeezed a hydrogen- 
bearing sample into 

a room-temperature 
superconductor, 
researchers claimed. 


After doubts grew, blockbuster Nature paper is withdrawn over objections of study team 


By Eric Hand 


n 2020, Ranga Dias, a physicist at the 

University of Rochester, and his col- 

leagues published a sensational result 

in Nature, featured on its cover. They 

claimed to have discovered a room- 

temperature superconductor: a mate- 
rial in which electric current flows friction- 
lessly without any need for special cooling 
systems. Although it was just a speck of 
carbon, sulfur, and hydrogen forged un- 
der extreme pressures, the hope was that 
someday the material would lead to vari- 
ants that would enable lossless electricity 
grids and inexpensive magnets for MRI 
machines, maglev railways, atom smash- 
ers, and fusion reactors. 

Faith in the result is now evaporating. 
On Monday Nature retracted the study, cit- 
ing data issues other scientists have raised 
over the past 2 years that have undermined 
confidence in one of two key signs of super- 
conductivity Dias’s team had claimed. 
“There have been a lot of questions about 
this result for a while,” says James Hamlin, 
an experimental condensed matter physi- 
cist at the University of Florida. But Jorge 
Hirsch, a theoretical physicist at the Uni- 
versity of California, San Diego (UCSD), 
and longtime critic of the study, says the 
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retraction does not go far enough. He be- 
lieves it glosses over what he says is evi- 
dence of scientific misconduct. “I think 
this is a real problem,” he says. “You cannot 
leave it as, ‘Oh, it’s a difference of opinion,” 

The retraction was unusual in that Na- 
ture editors took the step over the objec- 
tion of all nine authors of the paper. “We 
stand by our work, and it’s been verified 
experimentally and theoretically,’ Dias 
says. Ashkan Salamat, a physicist at the 
University of Nevada, Las Vegas, and an- 
other senior member of the collaboration, 
points out the retraction does not question 
the drop in electric resistance—the most 
important part of any superconductivity 
claim. He adds, “We’re confused and dis- 
appointed in the decision-making by the 
Nature editorial board.” 

The retraction comes even as excitement 
builds for the class of superconducting ma- 
terials called hydrides, which includes the 
carbonaceous sulfur hydride (CSH) devel- 
oped by Dias’s team. Under pressures greater 
than at the center of the Earth, hydrogen 
is thought to behave like a superconduct- 
ing metal. Adding other elements to the 
hydrogen—creating a hydride structure—can 
increase the “chemical pressure,’ reducing 
the need for external pressure and making 
superconductivity reachable in small labora- 


tory vises called diamond anvil cells. As Lilia 
Boeri, a theoretical physicist at the Sapienza 
University of Rome, puts it, “These hydrides 
are a sort of realization of metallic hydrogen 
at slightly lower pressure.” 

In 2015, Mikhail Eremets, an experimen- 
tal physicist at the Max Planck Institute 
for Chemistry, and colleagues reported the 
first superconducting hydride: a mix of hy- 
drogen and sulfur that, under enormous 
pressures, exhibited a sharp drop in elec- 
trical resistance at a critical temperature 
(Te) of 203 K (-70°C). That was nowhere 
near room temperature, but warmer than 
the Tc for most superconducting materials. 
Some theorists thought adding a third ele- 
ment to the mix would give researchers a 
new variable to play with, enabling them 
to get closer to ambient pressures—or 
room temperatures. For the 2020 Nature 
paper, Dias and colleagues added carbon, 
crushed the mix in a diamond anvil cell, 
and heated it with a laser to create a new 
substance. They reported that tests showed 
the telltale resistance drop at a Tc of 288 K 
(15°C)—roughly room temperature—and a 
pressure of 267 gigapascals, about 75% of 
the pressure at the center of the Earth. 

But in a field that has seen many super- 
conducting claims come and go, a drop 
in resistance alone is not considered suf- 
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ficient. The gold standard is to also pro- 
vide evidence of another key attribute of 
superconductors: their ability to expel an 
applied magnetic field when they cross Tc 
and become superconducting. Measuring 
that effect in a diamond anvil cell is im- 
practical, so experimentalists working with 
hydrides often measure a related quantity 
called “magnetic susceptibility.” Even then 
they must contend with tiny wires and 
samples, immense pressures, and a back- 
ground magnetic signal from metallic gas- 
kets and other experimental components. 
“It’s like you're trying to see a star when 
the Sun is out,” Hamlin says. 

The study’s magnetic susceptibility data 
were what led to the retraction. The team 
members reported that a_ susceptibility 
signal emerged after they had subtracted 
a background signal, but they did not in- 
clude raw data. The omission frustrated 
critics, who also complained that the team 
relied on a “user-defined” 
background—an assumed 
background rather than a 
measured one. But Salamat 
says relying on a user-define 
background is customary in 
high-pressure physics because 
the background is so hard to 
measure experimentally. 

In response to some of the 
criticisms, Dias and Salamat 
in 2021 posted a paper to the arXiv physics 
preprint server. It contained raw suscepti- 
bility data and purported to explain how 
the background was subtracted. “It raised 
more questions than it answered,” says 
Brad Ramshaw, a quantum materials phys- 
icist at Cornell University. “The process of 
going from the raw data to the published 
data was incredibly opaque.” 

Hirsch, a firebrand who has criticized 
other hydride superconductivity claims, 
has made stronger accusations. He says 
some of the published data presented by 
Dias and Salamat could be represented by 
a smooth polynomial curve—impossible 
for noisy laboratory measurements. “I 
think they were fabricated,” Hirsch says. 
He also noted suspicious similarities to 
data in a 2009 Physical Review Letters 
paper on superconductivity in europium 
under high pressures. That study, which 
shared one author with the Nature pa- 
per, was retracted last year because of in- 
accurate magnetic susceptibility data. 

In preprints, Hirsch kept hammering on 
the Dias study—so forcefully that in Febru- 
ary, arXiv temporarily banned him from 
posting. He also complained to the Univer- 
sity of Rochester, which in two inquiries 
found no evidence of scientific misconduct. 
This month, Hirsch and another critic, Dirk 
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“There have been 
a lot of questions 
about this 
result for a while.” 


James Hamilin, 
University of Florida 


van der Marel, a condensed matter physicist 
at the University of Geneva, published their 
conclusion in a physics journal that the sus- 
ceptibility data in the Dias study are “patho- 
logical.” Van der Marel is heartened by the 
Nature retraction. “It is good to know you 
are not alone in believing something is seri- 
ously wrong,” he says. 

Dias says the team plans to resubmit the 
paper to Nature without any background 
subtraction; he says the raw data alone 
show the change in magnetic susceptibility. 
Salamat also notes that Hirsch and Van der 
Marel are not high-pressure experimental- 
ists. “We believe that some of their actions 
have veered into personal attacks,” he says. 
“We're just not going to have people throw 
mud at us from a distance.” Dias sent “cease 
and desist” letters to Van der Marel and to 
Hirsch’s department chair and dean at UCSD. 

Eremets says the Dias study might still be 
right about CSH. But he has tried at least six 
times to replicate the results 
and failed. Although Dias’s 
team has shared the basics 
of its experimental protocol, 
Eremets says they have been 
less forthcoming in the details, 
such as what type of carbon 
they used in their CSH mix. 
Boeri agrees. “There are a lot 
of people who are a lot more 
careful, and they share the 
data, and they share the samples,’ she says. 

Salamat says colleagues are welcome to 
come to their labs and observe their meth- 
ods and protocols. “We have an open-door 
policy.” And he points to a CSH replication 
published in July. Critics question its inde- 
pendence, however, because it was led by 
Salamat’s group and includes many of the 
same authors as the Nature paper. 

Dias and Salamat are not slowing down. 
The duo has co-founded a company, Un- 
earthly Materials, to pursue commercial 
room-temperature superconductors. At 
conferences this summer, Dias has pre- 
sented claims of superconductivity in new 
hydride compounds. Although he declined 
to comment on those claims until they 
are published, he says, “We’ve moved on 
from the 2020 work.” Salamat adds, “We’re 
on the precipice of a new era of high- 
temperature superconductivity.” 

Eremets is skeptical that Dias’s new 
superconductors will stand up to scru- 
tiny. “How is this possible? Everything he 
touches turns to gold.” But he is confident 
that the patient work of science, under- 
pinned by painstaking replication, will sort 
the real promise of hydrides from the ques- 
tionable claims. “Science is not afraid of 
these things,” he says. “The truth, sooner or 
later, will come.” 
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University 
pandemic 
policies raise 
equity worries 


Tenure delays and pandemic 
impact statements could 
backfire, some fear 


By Katie Langin 


s the COVID-19 pandemic swept 
across the globe in 2020, calls began 
to ring out for universities to swiftly 
address concerns that interruptions 
to research, closure of schools and 
day cares, and other disruptions 
could widen existing inequities in aca- 
demia and make it harder for women and 
researchers from other underrepresented 
groups to stay afloat. Many universities in 
the United States and elsewhere went on 
to institute new policies to support early- 
career faculty, including delaying tenure 
decisions and giving applicants for ten- 
ure and promotion a chance to disclose 
how COVID-19 disruptions had impeded 
their work. But despite the good inten- 
tions, many worry these moves fall short of 
what is needed—and if not properly imple- 
mented, they could end up infusing more 
bias into tenure and promotion decisions. 
“These are complex problems in which 
a single pandemic impact statement or a 
tenure delay is not really going to address 
everything,” says Dawn Culpepper, associ- 
ate director of the ADVANCE Program for 
Inclusive Excellence at the University of 
Maryland, College Park. “We need more 
creative thinking on these issues.” 
Culpepper recently completed an analy- 
sis of policies for pandemic impact state- 
ments at 65 research-intensive universities 
in the United States and Canada. More 
than half didn’t have publicly available 
campus-wide directives. Of the 27 that 
did, policies varied widely. At some uni- 
versities, faculty were encouraged to write 
only single-paragraph statements; at oth- 
ers they were allowed to submit up to five 
pages. One university required all faculty 
going up for tenure or promotion to sub- 
mit a statement; the others simply offered 
the option. Only 59% of institutions pro- 
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vided guidelines for reviewers about how 
to interpret the statements, and most of 
those that did only gave vague instruc- 
tions. “I’m worried ... that reviewers are 
making judgment calls about what kind 
of COVID impact statements are valid and 
legitimate and which ones aren’t,” says 
Culpepper, whose analysis will be pub- 
lished in a book chapter in 2023. 

In all cases, faculty were instructed to fo- 
cus their statements on profes- 
sional disruptions, such as lab 
closures, travel restrictions, 
and increased teaching and ad- 
vising workloads. Twenty-two 
percent of institutions pro- 
hibited faculty from mention- 
ing personal circumstances 


“These issues 
are just going to 
get stickier 
and stickier.” 


people are at institutions where it is stig- 
matized, asking people to write about it 
could actually damage their long-term ca- 
reer prospects.” 

For some faculty members, it may also 
be risky to open up about any challenges, 
personal or professional. Black women, for 
example, “are expected to just do and be 
and show up no matter what’s going on,” 
says Christa Porter, an associate professor 
of higher education admin- 
istration at Kent State Uni- 
versity who has studied the 
experiences of Black women 
in academia. Their pandemic 
impact statements may not 
be viewed in the same light 
as similar statements writ- 


such as increased caregiving Dawn Culpepper, ten by, say, white men. Porter, 
responsibilities, but many oth- University of Maryland, who is also associate dean of 
ers left the door open to such College Park the graduate college, took that 


disclosures—a significant de- 

parture from standard tenure and promo- 
tion procedures. Normally, says Jessi Smith, 
a vice provost at the University of Colorado 
(CU), Colorado Springs, “The promotion 
and tenure process is really devoid of the 
human that is behind the package.” 

The decision to allow personal disclo- 
sures is well-intentioned, many argue, and 
could be an important step toward help- 
ing parents—the group that experienced 
the largest drop in productivity during 
the pandemic. But because stigma against 
motherhood lingers in academia, some 
worry that such disclosures could back- 
fire. “In a healthy institution, where care 
is not seen as a negative thing ... it should 
be easy enough for people to disclose,” 
says Joya Misra, a professor of socio- 
logy and public policy and director of the 
ADVANCE program at the University of 
Massachusetts, Amherst. “But as long as 


concern into account during 
her own recent merit review. As a Black 
woman, she elected to not mention hav- 
ing children at home during the pandemic, 
even though some of her colleagues did 
so. “I just know it’s going to be seen differ- 
ently,’ she says. 

Misra has been leading workshops on 
her campus and elsewhere to train faculty 
to review pandemic impact statements eq- 
uitably. She and her fellow trainers present 
case studies involving fictional professors 
and ask attendees to discuss how they 
would handle each case. “The idea is that 
if you think through enough cases before 
you start talking about colleagues in your 
department, you have a set of principles 
in mind,” she says. That way, it’s not about 
the person—it’s about the situations they 
experienced. “I truly believe that every in- 
stitution that has pandemic impact state- 
ments needs training on how to use them.” 
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Researchers are also concerned that 
inequities could arise from tenure delay 
policies, which went into place at more 
than 97% of research-intensive universi- 
ties in the United States, according to a 
study by Culpepper and colleagues in press 
in the ADVANCE Journal. Proponents of 
these policies “really firmly believe that 
we just need to keep giving tenure delays, 
because people need to catch up,” says a 
faculty member who requested anonymity 
to speak candidly about discussions with 
colleagues. They are “unable to pop out of 
this idea of how much people should have 
accomplished in a given time.” 

But others are pushing for a different 
approach: adjusting productivity expecta- 
tions for tenure decisions. “The boat was 
not the same for everybody in the COVID 
storm,” says Margaret Ptacek, a professor 
of biology and past director of the AD- 
VANCE program at Clemson University. 
“We really need to look at how tenure and 
promotion policy and practices occur and 
make sure that there’s a pathway for all dif- 
ferent types of identities of faculty to have 
the potential for success.” Critics also point 
to a 2018 study of tenure clock stoppage due 
to parental leave among economics profes- 
sors, which found that when such policies 
were implemented from 1980 to 2005, they 
boosted tenure rates for men but reduced 
them for women—likely because some men 
continued to work during their leave. 

“Continuing to wait until [a faculty mem- 
ber] ‘catches up’ just widens the power, 
access, and pay gaps that come with ten- 
ure and promotion,” says Smith, whose re- 
search found that women and members of 
underrepresented racial and ethnic groups 
at her campus and CU Boulder were the 
most likely to accept tenure delays during 
the pandemic. “If we are going to do that,” 
she continues, “then I would hope to see 
some sort of retroactive pay that gives those 
people the raise they would have received if 
they had gone up on time.” 

As things “return to normal,” some worry 
that conversations about accounting for 
the pandemic’s effects on academia, from 
current faculty to those still in training, 
are petering out prematurely. “We are still 
going to feel these impacts for a number of 
years, and in fact we probably haven’t even 
seen them really play out. So there’s even 
more reason to start paying attention to this 
and do so in a proactive way,’ Culpepper 
says. Going forward, “These issues are just 
going to get stickier and stickier.” 

“Universities did recognize that some- 
thing needed to be done,” Misra says. “But 
I don’t see as much progress as I had hoped 
to see. I really hoped that it would lead to 
more of a transformation.” © 
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Signs of state meddling seen in 
Russian academy election 


Leader of Russia's largest chipmaker elected president 
after incumbent’s sudden withdrawal 


By Olga Dobrovidova 


he Kremlin tightened its control over 

the 300-year-old Russian Academy 

of Sciences last week when its cur- 

rent president, Alexander Sergeev, 

withdrew his bid for a second term a 

day before the election, citing the “ad- 
ministrative pressures” many RAS members 
face for “speaking out.’ The 67-year-old laser 
physicist, who was widely expected to win, 
declined to explain his withdrawal in more 
detail to members attending the RAS annual 
meeting in Moscow, but said it was a “forced 
decision.” Instead, Gennady Krasnikov, who 
leads Mikron, Russia’s biggest chipmaker, 
won the 20 September election. 

The disruption shows how state meddling 
is weakening the once-powerful RAS, already 
hobbled under 2 decades of rule by Presi- 
dent Vladimir Putin, says Michael Gordin, 
who studies the history of Russian science at 
Princeton University. Keeping candidates off 
the ballot through coercion is “part of a menu 
of actions that the Russian state takes to en- 
sure favorable election outcomes,’ agrees 
Robert Moser, who studies Russian politics at 
the University of Texas, Austin. 

Some outspoken RAS members tried to 
organize an election boycott on short notice. 
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But a narrow quorum was present and will- 
ing to go ahead with a secret ballot for the 
two remaining candidates: Krasnikov and 
Dmitriy Markovich, who leads a thermo- 
physics research institute in Siberia and is 
not well known outside the region. Krasnikov 
won by a comfortable 871-397 vote. 

In the last RAS election, in 2017, Vladimir 
Fortov, the then-incumbent president run- 
ning for reelection, also withdrew on the 
opening day of the meeting. “The difference 
is, back then everyone withdrew and elec- 
tions were postponed,” says Askold Ivantchik, 
an RAS member and historian at the RAS 
Institute of World History and France’s na- 
tional research agency, CNRS. Six months 
later, Sergeev won, with Krasnikov coming 
in third. This time, Ivantchik says, “The door 
was left open for the candidate preferred by 
the government.” 

Speaking the day before elections, Yuri 
Solomonoy, chief designer for ballistic mis- 
siles at the Moscow Institute of Thermal Tech- 
nology, threw his support behind Krasnikov 
because of what he said were “unfortunate” 
signals that Sergeev didn’t have the support 
of the country’s leadership. (Solomonov had 
endorsed Sergeev in 2017.) Krasnikov, 64, is a 
lifelong Mikron employee, and in 2016, after 
Russia’s annexation of Crimea had triggered 


Gennady Krasnikov (left), with Russian President 
Vladimir Putin in 2015, is the new RAS president. 


sanctions, Putin put him in charge of elec- 
tronics as a national technological priority. 

Markovich, Krasnikov’s rival, was an un- 
usually progressive candidate who talked 
about issues that rarely get attention in Rus- 
sian science, such as gender imbalances. He 
used his final campaign speech to spotlight 
a scientist who he said was “mistakenly” 
charged with treason. (Since 2000, several 
dozen scientists, often working in fields re- 
lated to hypersonic weapons, have been 
charged with treason and jailed.) 

Irina Dezhina, a science policy researcher 
at the Institute for the Economy in Transi- 
tion, says the election chaos will be “a blow” 
to RAS. The academy has always been “en- 
tirely a creation of the state,” which pays its 
bills, Gordin notes. But in the turbulent 1990s 
it gained unprecedented influence over the 
nation’s research budgets, which Putin then 
reversed in a 2013 change that stripped RAS 
of most of its control over research institutes. 
“Tt is now a shadow of its former self in terms 
of power and social influence,’ Gordin says. 

At his first press briefing as president- 
elect, Krasnikov said the academy should be 
brought back into high-level decisions about 
science funding and policy. This echoes an 
academy call, published in August in its an- 
nual “state of the science” report, for a big- 
ger role in the “operational management” of 
science. Dezhina thinks the government will 
ignore the vague calls and adds, “We've got 
enough managers in science, it’s the man- 
aged that are the issue, with the number of 
researchers falling every year’ R&D budgets 
in Russia have stagnated at about 1% of gross 
domestic product, well below the average for 
developed nations. 

The invasion of Ukraine has isolated Rus- 
sian science and triggered wide-ranging 
sanctions—including measures against 
Mikron. It has also fractured the scientific 
community. In his concluding remarks be- 
fore the vote, Markovich wished for peace, a 
risky stance: Just 2 days later, public antiwar 
displays led to the detention of more than 
1400 street protesters across Russia—with 
some reportedly drafted into the war effort 
on the spot. 

In his election platform, Krasnikov blamed 
the recent technological constraints crippling 
Russian science on Western sanctions rather 
than Russia’s own actions. After his election, 
he also said he would prioritize “technological 
sovereignty’—a cause touted by Putin, who 
formally relieved Sergeev of his duties and 
confirmed Krasnikov this week. 


Olga Dobrovidova is a journalist who does climate 
communications work in Paris. 
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SCIENCE AND SECURITY 


Fraud charges crumble in China Initiative cases 


Judges reject claims that defendants defrauded agencies by not disclosing China ties 


By Jeffrey Mervis 


he U.S. government overplayed its 
hand in prosecuting U.S. academ- 
ics under the controversial China 
Initiative, three federal courts ruled 
last week. 

In separate cases, attorneys for the 
Department of Justice (DOJ) had main- 
tained that chemist Franklin Tao, materials 
scientist Zhengdong Cheng, and mathemati- 
cian Mingqing Xiao jeopardized the nation’s 
security and defrauded the government by 
deliberately hiding ties to Chinese institu- 
tions from the federal agencies 
funding their research. But last 
week, judges in Kansas, Texas, 
and Illinois either invalidated 
some of the most serious charges 
or handed down relatively lenient 
sentences for lesser violations. 
One judge overturned Tao’s fraud 
convictions, another accepted a 
plea deal that dropped nine fraud 
charges against Cheng, and the 
third sentenced Xiao to proba- 
tion rather than prison for failing 
to report a foreign bank account. 

Legal experts say DOJ’s theory 
of what constitutes defrauding a 
funding agency has turned out 
to be untenable. “These [results] 
show that the government’s [un- 
derlying] theory of the case was 
questionable,” says Margaret Lewis, a law 
professor and China scholar at Seton Hall 
University who doubts prosecutors should 
have pursued criminal convictions for what 
are often treated as civil or administrative vi- 
olations. “The primary duty of a prosecutor is 
to obtain justice, not a conviction,’ she says. 

DOJ issued a statement after Xiao was 
sentenced saying the department had “rec- 
ommended a sentence it believed appropri- 
ate under the law and facts of the case. [But] 
we respect the court’s decision.” It declined 
comment on the two other cases. 

Launched by then-President Donald 
Trump’s administration in 2018 to thwart 
economic espionage, the China Initiative led 
to charges against some two dozen academic 
scientists. Most were of Chinese ancestry, 
which civil rights groups said suggested anti- 
Asian bias. The initiative had a mixed track 
record: Although prosecutors won guilty 
pleas and prison terms for some defendants, 
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they later dropped charges against many oth- 
ers or failed to win jury verdicts. 

Last week brought new setbacks for DOJ. 
In the case of Tao, who is on unpaid leave 
from the University of Kansas, Lawrence, a 
jury in April found him guilty on one count of 
making a false statement by filing incomplete 
paperwork and three counts of fraud, while 
acquitting him of four related fraud charges. 
Prosecutors had argued that Tao committed 
fraud by hiding his work with Chinese insti- 
tutions from his university and two funding 
agencies: the National Science Foundation 
and the Department of Energy. 


Ata March protest, Hong Peng (right) called for justice for her husband, Franklin Tao. 


But last week U.S. District Judge Julie 
Robinson took the unusual step of throw- 
ing out the three fraud convictions. In her 
61-page decision, Robinson noted that the 
US. Supreme Court has held fraud requires 
evidence that the defendant deprived some- 
one of money or property. But Tao had in fact 
delivered the research he had promised to his 
funders and to his university, she wrote. “The 
evidence presented at trial showed that all 
three received what they bargained for.’ 

“In any other case, the government would 
never have thought to bring the fraud charge,” 
asserts Peter Zeidenberg, Tao’s attorney. “But 
when it comes to Chinese American scien- 
tists and on anything related to China, [pros- 
ecutors] haven’t applied the normal filtering 
process that they go through when deciding 
what is a meritorious case.” 

Tao is scheduled to be sentenced in Janu- 
ary on the sole remaining count. Robinson’s 
decision, meanwhile, echoes similar findings 


by judges in other China Initiative cases. 

In September 2021, for example, U.S. 
District Judge Thomas Varlan acquitted 
mechanical engineer Anming Hu of the Uni- 
versity of Tennessee, Knoxville, of six charges 
of defrauding NASA. “No rational jury could 
conclude that [Hu] acted with a scheme to 
defraud NASA by failing to disclose his affili- 
ation” with a Chinese university, Varlan wrote 
in his 52-page ruling. 

In April, U.S. District Judge Staci Yandle 
threw out two fraud charges against Xiao, a 
math professor at Southern Illinois Univer- 
sity (SIU), Carbondale. A jury then acquitted 
him of making false statements 
but found him guilty of failing to 
disclose a foreign bank account on 
his tax returns. 

Last week, however, the judge 
rejected the government’s request 
that Xiao receive a 1-year prison 
sentence and a heavy fine. Instead, 
she ordered Xiao to serve 1 year 
of probation and pay a $600 fine. 
Now on paid administrative leave, 
Xiao hopes to be reinstated by SIU. 

The tax charges “were more 
worthy of civil remedies—if they 
were worthy of any enforcement,’ 
says Xiao’s lead attorney, Ryan 
Poscablo of Steptoe & Johnson. 

Cheng, a former materials sci- 
entist at Texas A&M University 
(TAMU), College Station, was ar- 
rested and jailed in August 2020 on charges 
of making false statements and nine counts of 
fraud relating to a NASA grant. But last week 
the government dropped the fraud counts. 
Cheng then pleaded guilty to two counts of 
making false statements; he agreed to repay 
NASA $86,876 and pay a fine of $20,000. The 
two sides agreed on a prison sentence of time 
served; Cheng had already spent 13 months 
behind bars. (TAMU terminated Cheng in 
December 2020.) 

It’s not clear how many active China 
Initiative cases remain. In February, DOJ 
announced it was renaming the initiative be- 
cause of the perception that the phrase has 
had a “chilling effect on U.S.-based scientists 
of Chinese origin [and] fueled a narrative of 
intolerance and bias.” But Zeidenberg says 
he hasn’t seen any substantive change in 
the government’s approach to what it now 
calls “a strategy for countering nation-state 
threats” to the United States. & 
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COVID-19 


New Omicron strains may 
portend big COVID-19 waves 


Emerging subvariants are more immune evasive than ever 


By Gretchen Vogel 


early 3 years into the pandemic, 
SARS-CoV-2 faces a formidable chal- 
lenge: finding new ways around the 
immunity humans have built up 
through vaccines and countless in- 
fections. Worrisome new data show 
it is up to the challenge. Several new and 
highly immune-evasive strains of the virus 
have caught scientists’ attention in recent 
weeks; one or more may well cause big, new 
COVID-19 waves this fall and winter. 

“We can say with certainty that something 
is coming. Probably multiple things are com- 
ing,” says Cornelius Roemer, who studies viral 
evolution at the University of Basel. Whether 
they will also lead to many hospitalizations 
and deaths is the big question. 

“It’s not surprising that we're seeing 
changes that yet again help the virus to 
evade immune responses,” says molecular 
epidemiologist Emma Hodcroft of the Uni- 
versity of Bern, who notes that SARS-CoV-2 
faces “the same challenge that things like the 
common cold and influenza face every year— 
how to make a comeback.” 

The strains that look poised to drive the 
latest comeback are all subvariants of Omi- 
cron, which swept the globe over the past 
year. Several derived from BA.2, a strain that 
succeeded the initial BA.1 strain of Omicron 
but then was itself outcompeted in most 
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places by BA.5, which has dominated in re- 
cent months. One of these, BA.2.75.2, seems 
to be spreading quickly in India, Singapore, 
and parts of Europe. Other new immune- 
evading strains have evolved from BA.5, in- 
cluding BQ.1.1, which has been spotted in 
multiple countries around the globe. 

Despite their different origins, several of 
the new strains have chanced upon a similar 
combination of mutations to help scale the 
wall of immunity—a striking example of con- 
vergent evolution. They all have changes at 
half a dozen key points in the viral genome 
that influence how well neutralizing anti- 
bodies from vaccination or previous infec- 
tion bind to the virus, says evolutionary bio- 
logist Jesse Bloom of the Fred Hutchinson 
Cancer Center. 

To quickly gauge how well any new sub- 
variant may evade immunity, researchers 
make copies of the viruses’ spike proteins and 
expose them to monoclonal antibodies or sera 
from people to measure how well the anti- 
bodies can block the variants from infecting 
cells. Using such tests, researchers in China 
and Sweden have found that spike protein 
from BA.2.75.2 can effectively evade nearly all 
the monoclonal antibodies used for treating 
COVID-19, suggesting these treatments may 
become useless. 

Both groups also found that BA.2.75.2 
seems very good at evading immunity in 
humans. In a preprint posted on 19 Septem- 


Awoman receives an Omicron-specific booster 
at a pharmacy in Schwenksville, Pennsylvania. 


ber, immunologist Ben Murrell at the Karo- 
linska Institute and his colleagues reported 
that serum samples from 18 blood donors 
in Stockholm—where vaccination rates are 
high and prior infections widespread—were 
less than one-sixth as effective at neutralizing 
BA.2.75.2 compared with BA.5. “This is the 
most resistant variant we’ve ever evaluated,” 
says Karolinska virologist Daniel Sheward. 

Immunologist Yunlong Richard Cao at 
Peking University and his colleagues found 
similar results for BA.2.75.2 after testing 
blood samples from 40 people who had been 
vaccinated with three doses of CoronaVac, 
a vaccine made from inactivated virus, and 
100 more who had been vaccinated and then 
had breakthrough infections with BA.1, BA.2, 
or BA.5. The team found that BQ.1.1 had a 
similarly striking ability to evade antibodies. 

In their preprint, updated on 23 Septem- 
ber, Cao and his colleagues also report that 
the new variants do not seem to have lost 
any ability to bind tightly to the receptor 
on human cells that the virus uses to infect 
them, which means the variants’ infectious- 
ness has likely not decreased. And they re- 
port some evidence that infections with the 
variants trigger proportionally more of the 
wrong types of antibodies—those that bind 
tightly to the virus but don’t blunt its abil- 
ity to infect cells. All of that could portend 
a massive new wave, Cao says. “The scale of 
immune evasion has never been seen before, 
and the virus is still rapidly evolving,” he says. 
“Tt’s very bad.” 

Sheward and Murrell agree we should ex- 
pect lots of infections in the next few months, 
as happened last winter when Omicron en- 
tered the scene. But they’re less pessimistic 
than Cao, noting that many more people have 
recovered from an infection now or have re- 
ceived additional vaccine doses, including 
Omicron-specific boosters, whose rollout 
began this month. Those will boost overall 
antibody levels and will likely broaden the 
antibody repertoire, Sheward says: “I don’t 
think we’re quite back to square one.” 

“The choice to put BA.5 in the vaccine 
booster is still looking like a good one,’ 
Bloom adds. “The boosters are always going 
to be a step behind, but the good news is that 
the BA.5 booster is going to be one or two 
steps behind the virus’ evolution, instead of 
five steps behind.” 

Just how brutal a comeback the corona- 
virus has managed will become clear once 
more people become infected with the new 
strains. The next wave may also provide bet- 
ter clues about what factors trigger or pre- 
vent severe disease, Murrell says: “I think 
we're going to learn a lot this winter.” 
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CLASH OF THE TITANS 


The United States and Japan are embarking on ambitious efforts to wring a key 
secret of the universe from the subatomic phantoms known as neutrinos 
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By Adrian Cho 


mong physicists, those studying 
elusive particles called neutrinos 
may set the standard for dogged 
determination—or obdurate stub- 
bornness. For 12 years, scientists 
in Japan have fired trillions of 
neutrinos hundreds of _ kilo- 
meters through Earth to a gigan- 
tic subterranean detector called 
Super-Kamiokande (Super-K) to study 
their shifting properties. Yet the nearly 
massless particles interact with other mat- 
ter so feebly that the experiment, known as 
T2K, has captured fewer than 600 of them. 

Nevertheless, so alluring are neutrinos 
that physicists are not just persisting, they 
are planning to vastly scale up efforts to 
make and trap them. At stake may be in- 
sight into one of the most profound ques- 
tions in physics: how the newborn universe 
generated more matter than antimatter, 
so that it is filled with something instead 
of nothing. 

That prospect, among others, has 
sparked a race to build two massive sub- 
terranean detectors, at costs ranging from 
hundreds of millions to billions of dol- 
lars. In an old zinc mine near the former 
town of Kamioka in Japan, physicists are 
gearing up to build Hyper-Kamiokande 
(Hyper-K), a gargantuan successor to 
Super-K, which will scrutinize neutrinos 
fired from a particle accelerator at the 


Japan Proton Accelerator Research Com- 
plex (J-PARC) in Tokai 295 kilometers 
away. In the United States, scientists are 
developing the Deep Underground Neu- 
trino Experiment (DUNE) in a former 
gold mine in Lead, South Dakota, which 
will snare neutrinos from Fermi Na- 
tional Accelerator Laboratory (Fermilab) 
1300 kilometers away in Batavia, Illinois. 

Researchers with both experiments ac- 
knowledge they’re in competition—and that 
Hyper-K may have an advantage because it 
will likely start to take data a year or two 
before DUNE. Yet aside from their goals, 
“Hyper-K and DUNE are vastly different,” 
says Chang Kee Jung, a neutrino physicist 
at Stony Brook University and a T2K mem- 
ber who now also works on DUNE. 

Hyper-K, which will be bigger but 
cheaper than DUNE, represents the next 
in a series of ever larger neutrino detectors 
of the same basic design developed over 
40 years by Japanese physicists. It is all but 
certain to work as expected, says Masato 
Shiozawa, a particle physicist at the Uni- 
versity of Tokyo and co-spokesperson for 
the 500-member Hyper-K collaboration. 
“Hyper-K is a more established technology 
than DUNE,” he says. “That is why I pro- 
posed it.” 

DUNE will employ a relatively new tech- 
nology that promises to reveal neutrino 
interactions in stunning detail and allow 
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A cavern being excavated in Japan (left) will house Hyper-Kamiokande. It will be an even bigger version of the 
famed Super-Kamiokande neutrino detector (right), a vast, water-filled tank lined with phototubes. 
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physicists to test their understanding of the 
particles with unprecedented rigor. “With- 
out bragging too much, we are best in class,” 
says Sergio Bertolucci, a particle physicist 
at the University of Bologna and Italy’s Na- 
tional Institute for Nuclear Physics and co- 
spokesperson for the 1300-member DUNE 
collaboration. However, that technologi- 
cal edge comes with a hefty price tag and, 
Bertolucci acknowledges, more risk. 

How the rivalry plays out will depend on 
factors as mundane as the cost of under- 
ground excavation and as exciting as the 
possibility that neutrinos, always quirky, 
hold some surprise that will transform 
physicists’ understanding of nature. 


THE MOST COMMON PARTICLES in the uni- 
verse besides photons, neutrinos exert no 
effect on the everyday objects around us. 
Yet they could carry clues to deep mys- 
teries. Neutrinos and their antimatter 
counterparts both come in three types 
or flavors—electron, muon, and tau— 
depending on how they’re generated. For 
example, electron neutrinos emerge from 
the radioactive decay of some atomic nu- 
clei. Muon neutrinos fly from the decays 
of fleeting particles called pi-plus mesons, 
which can be produced by smashing a 
beam of protons into a target. These iden- 
tities aren’t fixed: A neutrino of one type 
can change into another, chameleonlike, as 
it zips along at near-light-speed. 

Weirdly, a neutrino of a definite flavor has 
no definite mass. Rather, it is a quantum 
mechanical combination of three differ- 
ent “mass states.” For example, a decaying 
pi-plus spits out the combination of mass 
states that makes a muon neutrino. How- 
ever, like gears turning at different speeds, 
the mass states evolve at different rates, 
changing that combination. So, a particle 
that began as an electron neutrino might 
later appear as a tau neutrino—a phenom- 
enon known as neutrino oscillation. 

Theorists can explain all of this with 
a mathematical clockwork known as the 
three-flavor model. It has just a hand- 
ful of parameters: roughly speaking, 
the probabilities with which one fla- 
vor will oscillate into another and 
the differences among the three 
mass states. The picture has gaps. 
Experiments show two mass states 
are close, but not whether the two 
similar states are lighter or heavier 
than the third—a puzzle known as 
the hierarchy problem. 

Moreover, neutrinos and _ anti- 
neutrinos might oscillate by differ- 
ent amounts, an asymmetry called 
charge-parity (CP) violation. Mea- 
suring that asymmetry is the prize 


1482 30 SEPTEMBER 2022 + VOL 377 ISSUE 6614 


Neutrinos in high-def 


Comprising two rectangular tanks, each filled with 17,000 tons of liquid argon, DUNE will precisely track all the 
charged particles produced when a high-energy neutrino fired from a distant laboratory strikes an argon nucleus. 
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The charged particles will liberate 
electrons, which an electric field will 
push to a grid of wires. By timing when 
each wire fires, physicists can infer the 
particles’ tracks and, ultimately, the 
incoming neutrino's energy. 


physicists seek, as it could help explain 
how the soup of fundamental particles in 
the early universe generated more matter 
than antimatter. 

The wispy neutrinos themselves did not 
tilt the matter-antimatter balance. Rather, 
according to some theories, the familiar 
neutrinos are mirrored by vastly heavier 
“sterile” neutrinos that would interact with 
nothing except neutrinos. If sterile neutri- 
nos and antineutrinos also behave asym- 
metrically, then in the early universe their 
decays could have generated more elec- 
trons than antielectrons (also called posi- 
trons), seeding the dominance of matter. 

Seeing CP violation in ordinary neutrinos 
wouldn’t prove this scenario played out, 
notes Patrick Huber, a theorist at Virginia 
Polytechnic Institute and State University. 
But not seeing CP violation among ordinary 
neutrinos would render it much less likely 
that the hypothetical heavyweights possess 


In MicroBOONE, a small liquid-argon detector at Fermilab, an energetic 
neutrino spawns charged particles, including an electron (long track). 
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the key asymmetry, Huber says. “It’s not im- 
possible, but it’s implausible,” he says. 

But, first, scientists must determine 
whether neutrinos really exhibit this asym- 
metry. The teams in Japan and the United 
States will both employ a well-established 
technique to probe neutrino behavior. By 
smashing energetic protons from a parti- 
cle accelerator into a target to produce pi- 
pluses, they will generate a beam of muon 
neutrinos and shoot it toward a distant 
underground detector. There, researchers 
will count the surviving muon neutrinos and 
the electron neutrinos that have emerged 
along the way. Then they will switch to pro- 
ducing a beam of muon antineutrinos, by 
collecting pi-minuses instead of pi-pluses 
from the target. They’ll repeat the measure- 
ments, looking for any differences. 

The experiment is much harder than it 
sounds, as several other factors could cre- 
ate a spurious asymmetry. For example, the 
neutrino and antineutrino beams 
will inevitably differ slightly, both 
in intensity and in their energy 
spectrum. To account for such dif- 
ferences, the researchers must sam- 
ple the particles as they start their 
journey by placing a small detector, 
preferably with a design as similar 
as possible to the distant detector, in 
front of the beam source. 

The physics of neutrinos them- 
selves could also skewthe results. For 
example, either neutrinos or anti- 
neutrinos will be absorbed more 
strongly by the matter they traverse 
on their flight to the detector. The 
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Ring around a neutrino 


An underground tank filled with 260,000 tons of water and lined with photodetectors, Hyper-Kamiokande will 
detect neutrinos shot from hundreds of kilometers away by exploiting the optical equivalent of a sonic boom. 
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direction of that effect depends on the so- 
lution to the hierarchy problem. So, to spot 
CP violation, physicists will most likely 
have to solve the hierarchy problem, too. 
The biggest barrier to sorting all of this 
out, however, has been the measly harvest 
of neutrinos from even the biggest experi- 
ments. Like their counterparts in Japan, U.S. 
physicists already have a neutrino-oscillation 
experiment, NOvA, which shoots neutrinos 


which is lighter than a muon and will be 
buffeted more by the water molecules. The 
result will be a fuzzier light ring. Muon and 
electron antineutrinos can spawn detect- 
able antimuons and antielectrons by strik- 
ing protons, although with about half the 
efficiency of the neutrino interactions. 
Hyper-K will be Japan’s third great de- 
tector, all in the same mining area. From 
1983 to 1995, the Kamioka Nucleon Decay 


from Fermilab to a detector 810 kilometers 
north in Minnesota. Like T2K, it has netted 
just several hundred neutrinos. 


HYPER-K WILL TACKLE the scarcity primarily 
by providing a much bigger target for the 
neutrinos to hit. Proposed a decade ago, 
it’s a scaled-up version of the storied Super- 
K detector and will consist of a cylindrical 
stainless steel tank 78 meters tall and 74 me- 
ters wide, holding 260,000 tons of ultrapure 
water—five times as much as Super-K. 

To spot neutrinos, the detector will rely 
on the optical equivalent of a sonic boom. 
Rarely, a muon neutrino zipping through 
the water will knock a neutron out of an 
oxygen atom and change it into a proton, 
while the neutrino itself morphs into a 
high-energy muon. The fleeing muon will 
actually exceed the speed of light in water, 
which is 25% slower than in a vacuum, and 
generate a shock wave of so-called Cheren- 
kov light, just as a supersonic jet creates a 
shock wave of sound. That conelike shock 
wave will cast a ring of light on the tank’s 
side, which is lined with photodetectors. 

Similarly, an electron neutrino can strike 
a neutron to produce a high-speed electron, 
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Experiment (Kamiokande), a 3000-ton de- 


In Super-K, a muon neutrino turns into a muon, which 
radiates a tidy light ring (top). An electron neutrino 
spawns an electron and a fuzzier ring (bottom). 


tector, tried to spot the ultrarare decays 
of protons that some theories predict. In- 
stead, in 1987, it glimpsed neutrinos from 
a supernova—an advance that won a share 
of the Nobel Prize in Physics in 2002. In 
1996, Super-K came online. It proved neu- 
trinos oscillate by studying muon neutri- 
nos generated when cosmic rays strike 
the atmosphere. Fewer come up from the 
ground than down from the sky, showing 
that those traversing Earth change fla- 
vor along the way. The discovery shared 
the Nobel in 2015. “It’s spectacular what 
[Japanese physicists] have done,’ says 
Erin O’Sullivan, a neutrino astrophysi- 
cist at Uppsala University and a Hyper-K 
member who was drawn by “the dynasty 
of Super-K.” 

Hyper-K will reuse J-PARC’s neutrino 
beam, which is now being upgraded to in- 
crease its power by a factor of 2.5. Overall, 
it should collect data at 20 times the rate of 
T2K, says Stephen Playfer, a particle physi- 
cist at the University of Edinburgh and 
the University of Tokyo and the project’s 
lead technical coordinator. Before joining 
Hyper-K in 2014, he and his Edinburgh 
colleagues also considered joining DUNE. 
“When it came to comparing who was go- 
ing to be first to see something, we thought 
Hyper-K was in a good position, just be- 
cause it would have the statistics and it 
had a well-known technology,” he says. 

Hyper-K will have limitations. In par- 
ticular, it won’t measure the neutrinos’ en- 
ergies precisely. That matters because the 
rate at which a neutrino oscillates depends 
on its energy, and a beam contains neutri- 
nos with a range or spectrum of energies. 
Without a way to pinpoint each neutrino’s 
energy, the experiment would be unable to 
make sense of the oscillation rates. 

To avoid this problem, Hyper-K, like 
current experiments, will rely on a trick. 
A neutrino beam naturally diverges, with 
lower energy neutrinos spreading more 
than higher energy ones. Thus, if a detec- 
tor sits slightly to the side of the beam’s 
path, it will see neutrinos with a narrower 
range of energies that should oscillate at 
roughly the same rate. So, like Super-K, 
Hyper-K will sit off the beam axis by an 
angle of 2.5° 

Physicists can then tune the energy of 
the beam so the neutrinos reach the detec- 
tor when the oscillation is at its maximum. 
With the neutrinos’ energy constrained, 
physicists basically count the number of 
arriving muon neutrinos, electron neutri- 
nos, and their antimatter counterparts. 
Hyper-K’s CP measurement comes down to 
comparing two ratios: electron neutrinos 
with muon neutrinos and electron anti- 
neutrinos with muon antineutrinos. 
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Since 2015, DUNE researchers have built prototypes at the European particle physics laboratory, CERN, which have performed even better than expected. 


Workers have already begun the exca- 
vation for Hyper-K, which should take 
2 years, Shiozawa says. The whole project 
will cost Japan about $600 million, with 
international partners chipping in an ad- 
ditional $100 million to $200 million, 
he says. The detector will be complete in 
2027, Shiozawa says, and will start taking 
data a year later. So confident are Hyper- 
K researchers in their technology that they 
say the trickiest part of the project is the 
digging. “We need to construct 
probably the largest under- 


ground cavern” in the world, 
Shiozawa says. “In terms of tech- 
nology and also cost, this is the 
biggest challenge.” 


IF, TECHNOLOGICALLY, Hyper-K 
amounts to much more of the 
same, DUNE aims to be some- 
thing almost completely differ- 
ent. It will employ a technology 
that, until recently, was used in 
only one other large experiment 
but that should enable physicists 
to see neutrino interactions as 
never before. “To me, the draw 
of DUNE is its precision,’ says 
Chris Marshall, a particle physi- 
cist at the University of Rochester 
and DUNE’s physics coordinator. 
“This is an experiment that will 
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be world leading in just about everything 
that it measures.” 

Hunkering 1480 meters deep in a re- 
purposed gold mine, DUNE will consist 
of two rectangular tanks 66 meters long, 
19 meters wide, and 18 meters tall. Each 
will contain 17,000 tons of frigid liquid 
argon cooled to below -186°C. Just as in a 
water-filled detector, a neutrino can blast a 
neutron—in this case in an argon nucleus— 
to create a muon or an electron. But the 


Light glints off the planes of closely spaced, electron-catching wires within 
a DUNE prototype. The wires are 150 micrometers thick, like heavy hair. 


neutrinos reaching DUNE from Fermilab 
will pack up to 10 times more energy than 
those flowing to Hyper-K. So, in addition to 
the muon or electron, a collision will typi- 
cally produce a spurt of other particles such 
as pions, kaons, protons, and neutrons. 

DUNE aims to track all _ those 
particles—or at least the charged ones— 
with a technology called a liquid argon 
time projection chamber. As a charged 
particle streaks through the argon, it will 
ionize some of the atoms, freeing 
their electrons. A strong electric 
field will push the electrons side- 
ways until they hit three closely 
spaced planes of parallel wires, 
each plane oriented in a differ- 
ent direction. By noting when 
the electrons strike the wires, 
physicists can reconstruct with 
millimeter precision the origi- 
nal particle’s 3D trajectory. And 
from the amount of ionization it 
produces, they can determine its 
type and energy. 

The details are mind-boggling. 
The electrons will have to drift as 
far as 3.5 meters, driven by a volt- 
age of 180 kilovolts. And unlike 
Hyper-K, DUNE will sit directly 
in the beam from Fermilab. So, it 
will capture a bigger but messier 
harvest of neutrinos, with ener- 
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gies ranging from less than 1 giga-electron 
volt to more than 5 GeV. 

DUNE’s ability to precisely track all the 
particles should enable it to do something 
unprecedented in neutrino physics: Mea- 
sure the energy of each incoming neutrino 
to construct energy spectra for each flavor 
of neutrino and antineutrino. Because of 
the flavor changing, a plot of each spec- 
trum should itself exhibit a distinct wiggle 
or oscillation. By analyzing all of the spec- 
tra, physicists should be able to nail down 
the entire three-flavor model, including 
the amount of CP violation and the hier- 
archy, in one fell swoop, Bertolucci says. 
“It can measure all the parameters in the 
same experiment,” he says. 

Until now, the technology has never 
been fully developed. Italian Nobel laure- 
ate Carlo Rubbia dreamed up the liquid 
argon detector in 1977. But it wasn’t until 
2010 that one called ICARUS in Italy’s sub- 
terranean Gran Sasso National Laboratory 
snared a few neutrinos shot from the Eu- 
ropean particle physics laboratory, CERN, 
near Geneva, Switzerland. Researchers at 
Fermilab and CERN have embarked on a 
crash program to build prototypes, which 
have worked even better than expected, 
says Kate Scholberg, a neutrino physicist 
and a DUNE team member at Duke Uni- 
versity. “It’s kind of an entrancing thing to 
look at the event displays” coming in, she 
says. “It’s just incredible detail.” 

That precision comes at a price. For ac- 
counting purposes, the U.S. Department of 
Energy (DOE) splits the project in two. One 
piece, the Long Baseline Neutrino Facility 
(LBNF), includes the new neutrino beam 
at Fermilab and all the infrastructure. The 
second, DUNE, is an international collabo- 
ration that will build just the guts of the 
detectors. In 2015, DOE estimated LBNF/ 
DUNE would cost $1.5 billion and come 
online in 2027. Last year, however, DOE re- 
ported that unexpected construction costs 
had raised the bill to $3.1 billion. The de- 
tector should be completed in 2028, says 
Christopher Mossey, project director for 
LBNF/DUNE-U.S. But the beam will lag 
until early 2031, potentially giving Hyper- 
Ka head start of more than 2 years. 

With contracts in hand and construction 
underway, DUNE developers are confident 
that the new cost and timeline will hold. 
Excavation has passed 40% and should 
be completed in May 2023, Mossey says. 
“We really are accomplishing big, tangible 
things.” Still, DUNE physicists acknowl- 
edge that the project is riskier than Hyper- 
K. “It’s a leap into the unknown, and that’s 
the trade-off you make,” Scholberg says. 
“Something that is more transformative is 
going to certainly entail more scariness.” 
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BOTH EXPERIMENTS have other scientific 
goals, such as searching for proton decay. 
There, Hyper-K has an advantage, Huber 
says, as it is simply bigger and contains 
lots of lone protons in the hearts of the hy- 
drogen atoms in water molecules. Another 
tantalizing payoff could come if a giant star 
collapses and explodes as a supernova near 
our Galaxy, as one did in 1987. The experi- 
ments would provide complementary ob- 
servations, Scholberg says, as DUNE would 
see the electron neutrinos produced just 
as the core implodes and Hyper-K would 
see mainly electron antineutrinos released 
later in the explosion. 


requires more external inputs,” Huber says. 
DUNE, in contrast, should be able to 
disentangle the whole mess on its own. 
Shiozawa, for one, is not counting out his 
rival. The Japanese project has experienced 
growing pains of its own, he notes, including 
being scaled back from an initial 1-million- 
ton design. And the Japanese government 
won't countenance any cost increase, put- 
ting project leaders in constant tension 
with contractors, he says. “The situation is 
not so different between the two projects.” 
Ultimately, the rivalry between Hyper- 
K and DUNE may be less a dash for glory 
than a decadeslong slog through uncer- 


Hyper-Kamiokande will deploy new and improved phototubes, which must withstand pressures up to six 
atmospheres at the bottom of the tank. 


Nevertheless, the raison d’étre for both 
experiments remains deciphering neutrino 
oscillations and searching for CP violation. 
So, how would a gambler handicap this race? 

Given their head start, Hyper-K physicists 
could score major discoveries before DUNE 
even finds its feet. If CP violation is as big 
as it can possibly be, “then we may discover 
it in 3 years,’ Shiozawa says. “And also, we 
may discover proton decay in 3 years.” But, 
he says, “It really depends on nature.” 

Hyper-K is optimized to measure CP vio- 
lation assuming it’s big and the three-flavor 
model is the final word on neutrino oscil- 
lations, Huber notes. Neither assumption 
may hold. And with its simpler counting 
technique and shorter baseline, the experi- 
ment may struggle to distinguish CP vio- 
lation from the matter effect unless some 
other experiment independently solves 
the hierarchy problem. “Hyper-K certainly 


tainty. If so, the two teams could end up 
collaborating as much as they compete, at 
least informally. “We’ll have a long time 
where the most accurate results will come 
from a combination of the two [experi- 
ments],” Huber predicts. 

Most tantalizing, instead of completing 
the current theory, the results could upend 
it. They might reveal deviations from the 
three-flavor model that could hint at new 
particles and phenomena lurking in the 
vacuum. After all, neutrinos have repeat- 
edly surprised physicists, who once as- 
sumed that the particles came in only one 
type and were completely massless and in- 
ert. “Previously, neutrino experiments have 
taught us that we very rarely take data in 
a neutrino beam and get exactly what we 
expect,” Marshall says. 

The unexpected may be a long shot 
worth betting on. 
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METHANE EMISSION 


Plant et al. found that the 
efficiency of methane removal 
by flaring at US oil and gas 
facilities is lower than expected. 


Tackling unlit and inefficient gas flaring 


Emissions from flaring threaten the global climate and the health of local communities 


By Riley Duren?? and Deborah Gordon? 


ecause of the outsized warming po- 
tential of methane, it is essential to 
limit this superpollutant to mitigate 
the worsening global climate emer- 
gency. More than 120 nations have 
signed the Global Methane Pledge to 
cut emissions by 30% by 2030 (1). Some 80 
oil and gas (O&G) companies have joined 
the Oil and Gas Methane Partnership, 


1Carbon Mapper, Inc., Pasadena, CA, USA. *Arizona 
Institute for Resilient Environments and Societies and 
Research, Innovation and Impact, University of Arizona, 
Tucson, AZ, USA. °RMI, Basalt, CO, USA. “Watson Institute 
for International and Public Affairs, Brown University, 
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which aims to establish a reporting frame- 
work for methane emissions (2), such as 
those from flaring—the practice of burn- 
ing off unwanted gas. The World Bank has 
enlisted public and private stakeholders 
to commit to zero routine flaring by 2030 
(3). On page 1566 of this issue, Plant et al. 
(4) report an average methane destruction 
removal efficiency of 87 to 94% by flaring 
in US-based O&G facilities. This number is 
much lower than what was previously as- 
sumed, and the authors attribute this to 
inefficient and unlit flares. 

Flares are designed into O&G facilities 
as a safety precaution and should only be 
used in emergencies. Beyond safety, satel- 
lite data have shown that flaring may be 


resorted to for various reasons throughout 
O&G facilities globally (5). In 2021, an es- 
timated 144 billion m* of gas was released 
through flaring—enough to generate two- 
thirds of the European Union’s electricity 
needs. Currently, most O&G facilities pro- 
vide no direct tracking of the methane re- 
moval efficiency (or even the operational 
status) of their flares. Until recently, it 
was assumed that most flares performed 
at nearly 100% efficiency, such that com- 
bustion effectively converted all of the 
flared gas into water and carbon dioxide 
(6). However, Plant et al. and other recent 
studies are providing a reality check on this 
overly optimistic view of flaring efficiency. 

The authors studied more than 300 flares 
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from three major O&G 
production basins respon- 
sible for 80% of flaring in 
the US. The finding that a 
small number of inefficient 
or unlit flares are respon- 
sible for much higher than 
predicted emissions is not 
new. For instance, an ear- 
lier study found that ~1000 
methane superemitters in 
the Permian basin (in New 
Mexico and Texas) were 
responsible for 60% of the 
region’s methane emis- 
sions, with 12% of those 
emissions coming from 
inefficient flares, most of 
which were persistently 
unlit (7). The findings of 
Plant et al. reveal a broader 
trend for flaring across the 
US, which might be representative of the 
situation globally, indicating the need for 
broader observational studies. They also 
found that methane emissions from flar- 
ing in the US are five times as high as cur- 
rently reported in the US Environmental 
Protection Agency’s (EPA) greenhouse gas 
inventory. 

The International Energy Agency (IEA) 
estimated global average flaring efficiency 
at 92% in 2020 on the basis of satellite- 
derived flaring volumes and bottom-up op- 
erating data (8). In total, IEA estimated 8 
million metric tons (Mt) of methane emit- 
ted from global flaring in 2020—equiva- 
lent to 660 Mt of CO, over 20 years (9). 
This is as large an emission footprint as 
that from half of all the cars in the US. To 
make matters worse, this estimate may 
be lower than reality because unlit flares 
are mostly unaccounted for. In addition to 
their global warming contribution, inef- 
ficient and unlit flarings also present air 
quality, health, and environmental justice 
concerns for local communities. Among 
the flares monitored by Plant et al., only 55 
to 76% of the gas sent to flares was meth- 
ane, and the remainder likely included 
volatile organic compounds and hazardous 
air pollutants, such as benzene. In the US, 
an estimated 18 million people live within 
2 km of an O&G well (10), and ~500,000 
people live within 5 km of an active flare 
in the Permian basin, the Eagle Ford basin 
(in Texas), and the Bakken basin (in North 
Dakota and Montana) (17). Unlit and in- 
completely combusted flares may expose 
nearby communities to a cocktail of copol- 
lutants that pose health risks. 

A range of factors, including design 
flaws, perverse policies, operational neg- 
ligence, and regulatory and enforcement 
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Methane plume from an unlit flare in the Permian basin is detected by a remote sensing aircraft. 


gaps, all contribute to unlit and inefficient 
flaring. Although there are engineering so- 
lutions to increase the efficiency of flares, 
this problem largely lies with the lack of 
regulation and oversight. In some parts of 
the US, North Dakota for example, allow- 
able efficiency for flaring is set as low as 
90% (12). Exemptions in state and federal 
rules also allow routine flaring when “no 
sales line is available” which means that 
operators can simply burn their excess pro- 
duction as waste. In the Permian basin, for 
example, where pipeline capacity is limited, 
O&G operators routinely opt to burn the gas 
rather than sell it. 

Better ways to monitor methane emis- 
sions may also be needed for operators and 
regulators to comply with and enforce the 
stricter emission standards if they are up- 
dated. Existing satellites can see the heat 
signatures of active flares but are mostly 
unable to detect unlit flares and have lim- 
ited ability to directly quantify methane 
removal efficiency. Future satellites, such as 
the planned Carbon Mapper constellation 
(13), will offer more-comprehensive global 
flare tracking by having lower methane 
detection limits, higher spatial resolution, 
larger area coverage, and more-frequent 
sampling. Newer satellites, along with 
ground- and air-based monitoring systems, 
can be coordinated to focus on regions 
with the greatest risk for flaring issues. 
Monitoring systems can help direct fur- 
ther research studies to better characterize 
regional variations in flaring performance 
and evaluate previously unmonitored fa- 
cilities in remote areas. Actionable methane 
data, when combined with rapid notifica- 
tion protocols, can alert operators for the 
expedited repair of inefficient and unlit 
flares and other malfunctions, such as gas 


and oil leaks. Requirements 
for self-reporting from op- 
erators, especially during 
the initial months when 
a production well comes 
online and during disrup- 
tions, could also improve 
oversight and reduce waste. 

In addition to improved 
monitoring, there is a need 
for better models to guide 
improvements in flare 
design and_ operations. 
Observational data can be 
integrated with open-source 
engineering models like 
the Oil Climate Index plus 
Gas, which estimates O&G 
flaring and total life cycle 
emissions (14). Together, 
satellites, surface sensors, 
and models can _ provide 
more-accurate assessments of the role that 
improved flaring efficiency plays in overall 
O&G emissions and future mitigation efforts. 
The technology exists to prevent unlit and 
inefficient flares across O&G supply chains. 
Implementing flare fixes worldwide, how- 
ever, will require further improvements in 
monitoring, regulations, and industry prac- 
tices. As the global climate crisis continues to 
worsen, resolving the flaring problem in this 
decisive decade is among the easiest routes 
to help mitigate the problem. & 
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Metabolite-driven antitumor immunity 


An oncometabolite blocks T cell killing by inhibiting glycolysis 


By James A. Nathan 


tto Warburg’s observations that solid 

organ tumors preferentially use 

aerobic glycolysis, metabolizing glu- 

cose to lactate, to facilitate growth 

paved the way for our understand- 

ing of how nutrient availability and 
metabolism can promote cancer. A common 
theme is that proliferating tumor cells use a 
diverse array of mechanisms to acquire nu- 
trients to grow rather than to simply support 
adenosine triphosphate (ATP) production 
(1). However, promoting cell growth is gener- 
ally not sufficient for solid-organ tumors to 
develop because they must also 
evade cytotoxic T cells, raising 
the question of whether this 
immune evasion is linked to 
metabolism. On page 1519 of 
this issue, Notarangelo et al. (2) 
show that p-2-hydroxyglutarate 
(D-2HG), a metabolite secreted 
by tumors with mutations 
in isocitrate dehydrogenase 
(IDH) (3), functions within 


the tumor’ microenviron- 

ment (TME) to impair CD8* T ; 
cell-mediated tumor cell kill- - 

ing. These findings highlight 9 

the potential for metabolites a 


within the TME to alter anti- 


tumor immune responses. ° 


Somatic mutations in IDH 
occur in a wide range of tu- 
mors, including acute myeloid 
leukemia and ~80% of glio- 
mas, the most common form 
of brain cancer (4). Disease- 
causing mutations disrupt the 
canonical action of IDH to 
convert isocitrate to a-ketoglutarate (aKG) 
in the tricarboxylic acid cycle and instead 
promote the formation of p-2HG from aKG 
(3). D-2HG acts as an “oncometabolite” 
within cells, contributing to tumorigenesis 
through inhibition of metabolite-sensitive 
enzymes (aKG-dependent dioxygenases) 
involved in modulating epigenetic marks 
and transcriptional reprogramming (5). 
However, D-2HG also accumulates to high 
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levels within the TME (6). Notarangelo et 
al. investigate how extracellular p-2HG 
may be related to prior observations that 
IDH-mutant gliomas are protected from 
infiltrating T cells (6-8). 

Notarangelo et al. find that treating iso- 
lated mouse CD8* T cells with p-2HG results 
in decreased proliferation, reduced lytic 
granule release, and impaired secretion of 
the cytokine interferon-y (IFN-y), leading 
to a functional loss of cytotoxic killing in vi- 
tro (see the figure). D-2HG needs to be pres- 
ent at the time of tumor cell recognition by 
CD8* T cells to impair killing, and removal 
of the metabolite restores T cell function. 


i 
Suppressing T cell killing 
D-2-hydroxyglutarate (D-2HG) is produced by isocitrate dehydrogenase (IDH) mutant 
tumors and accumulates in the tumor microenvironment. D-2HG uptake by surveilling 
CD8* T cells inhibits lactate dehydrogenase (LDH), which decreases glycolysis and 
disrupts the NAD*-NADH balance. This drives respiration and thereby increases 
reactive oxygen species (ROS) produced by Complex | of the 
electron transport chain. Together, this impairs T cell 
proliferation and cytotoxic activity. 


ROS 


2 Lactate 


This rapidly reversible alteration in T cell 
function is not consistent with p-2HG-me- 
diated changes in epigenetic marks (5), 
and no transcriptional reprogramming of 
T cells was found within 24 hours of treat- 
ment. Instead, the authors find that p-2HG 
exerts its effects by perturbing T cell gly- 
colysis through direct inhibition of lactate 
dehydrogenase (LDH), which catalyzes the 
conversion of pyruvate to lactate, using re- 
duced nicotinamide adenine dinucleotide 
(NADH) as a cofactor. Subsequently, this al- 
ters the balance between NAD* and NADH, 
which are important for maintaining redox 
levels in cells. 
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ADP, adenosine diphosphate; ATP, adenosine triphosphate; 
NAD*, nicotinamide adenine dinucleotide; NADH, reduced NAD*. 


Prior studies have observed that p-2HG 
uptake by tumor-infiltrating macrophages 
and cytotoxic T cells can lead to suppres- 
sion of antitumor immunity by altered 
tryptophan metabolism and calcium sig- 
naling (6, 9). Notarangelo et al. show a dis- 
tinct mechanism of evading T cell killing, 
demonstrating that regeneration of NAD* 
pools by LDH activity is central to CD8* T 
cell function. They find that p-2HG treat- 
ment of mouse CD8* T cells impairs gly- 
colysis, resulting in a shift to respiration 
and shuttling of cytosolic NADH to the mi- 
tochondria. The resulting increase in activ- 
ity of Complex I helps sustain CD8* T cell 

proliferation when glycolysis 
impaired. Consistently, 
Notarangelo et al. show that 
treatment of cultured CD8* T 
cells with several LDH inhib- 
itors phenocopies the effects 
of p-2HG treatment, and that 
resetting the NAD*-NADH 
balance in vitro, by using 
a bacterial NADH oxidase 
to generate NAD* indepen- 
dently of LDH, prevents the 
shift to respiration. 

The contribution of D-2HG 
to evading antitumor im- 
munity was evident in both 
mouse models and _ tissue 
sections from patients with 
IDH-mutant tumors. Using 
single-cell studies and multi- 
plexed immunofluorescence, 
Notarangelo et al. find that 
human IDH-mutant gliomas 
demonstrate an impaired 
IFN-y signature comparable 
with that seen in mouse T cells 
treated with p-2HG, and they observe exclu- 
sion of CD8* T cells from regions with high 
pD-2HG levels in IDH-mutant brain tumor 
sections from patients. The timing of expo- 
sure to D-2HG, and resulting LDH inhibition, 
may dictate the consequences for metabolic 
reprogramming of CD8* T cells. Earlier work 
that used a mouse model of melanoma found 
that pretreatment of T cells with an LDH in- 
hibitor before their transfer into mice bear- 
ing melanoma actually improved antitumor 
immunity (0). These differential effects of 
LDH inhibition on cytotoxic T cells may need 
to be further resolved, but in the context of 
the TME, the findings of Notarangelo et al. 
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reveal a rapid LDH-dependent rewiring of 
CD8* T cell function. 

The clinical use of inhibitors targeting 
IDH mutations suggests that drugs that 
decrease pD-2HG levels may also confer 
clinical benefits through reversing T cell 
dysfunction. To date, small-molecule mu- 
tant IDH inhibitors are approved for clini- 
cal use in acute myeloid leukemia (JZ) and 
are being tested in IDH-mutant gliomas. 
Inhibitors of mutant IDH1 can improve an- 
titumor T cell responses in mouse models 
(6, 12), and it will be important to explore 
their potential enhancement of antitumor 
immunity in clinical disease. 

The study of Notarangelo et al. high- 
lights the role for metabolites in the TME 
to rewire immune cells but, more broadly, 
raises the question of how this rapid 
metabolic reprograming integrates with 
changes in epigenetic marks and immune 
cell function. In the context of D-2HG, this 
will be dependent on the relative con- 
tribution of active uptake and secretion, 
and how this flux is controlled in T cells 
remains to be fully determined. Moreover, 
the relative contribution to transcriptional 
programming will result from a complex 
interplay of the abundance of additional 
metabolites that can alter epigenetic 
marks (such as succinate, aKG, 1L-2HG, 
and itaconate), other nutrients within the 
TME, and oxygen availability. Notarangelo 
et al. also find that the effects of p-2HG 
on CD8* T cells are independent of the hy- 
poxia-inducible factor pathway (which is 
regulated by aKG-dependent dioxygenases 
and transcriptionally up-regulates glycoly- 
sis), but how oxygen availability within the 
TME interplays with metabolic rewiring 
will be of interest to explore. As the tech- 
nical challenges in determining metabolite 
concentrations within the TME begin to be 
overcome, non-cell-autonomous metabolic 
control of immune evasion by tumor cells 
can be further understood. 
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The genetics of a long life 


Genetically diverse mice and cross-species comparison 


uncover links to longevity 


By Joao Pedro de Magalhdes 


lucidating the genetics of complex 
traits and diseases is a major under- 
taking, one that can lead to biological 
insights and, because genetics increas- 
ingly informs drug discovery (J), new 
treatments. Longevity reflects aging 
and other processes affecting health and dis- 
ease, SO unraveling its genetic basis would 
have far-reaching implications. However, it 
has been a challenge to identify the genetic 
determinants of longevity in humans or ani- 
mals (2) in part because longevity as a phe- 
notype encompasses multiple other traits 
and disease risks. Tackling the complexity of 
the genetic component of longevity requires 
an integrated approach that leverages multi- 
ple sources of evidence. On page 1508 of this 
issue, Bou Sleiman et al. (3) use genetically 
heterogeneous mice to find new life-span 
loci and further perform cross-species com- 
parisons to better understand the genetic 
basis of mammalian longevity. 
Recent advances in identifying single 
genes that, when genetically manipulated, 
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modulate longevity in animal models have 
been substantial (4). According to the 
GenAge database (5), there are more than 
2000 longevity-associated genes in model 
organisms including mice, flies, yeast, and 
worms. One underappreciated limitation of 
such studies, however, is that they are mostly 
conducted in inbred, genetically homoge- 
neous animal populations. This means that 
discoveries in the genetics of aging, as well 
as dietary and pharmacological manipula- 
tions, may be strain specific because there 
could be genetic background effects (6). 
Given that most longevity studies in 
mice use isogenic stocks, the role of natu- 
ral genetic variation in longevity in mice 
is not only poorly understood but largely 
unexplored. Bou Sleiman et al. genotyped 
3276 mice from the Interventions Testing 
Program (ITP). The heterogeneous mice 
used in the ITP were derived by a four-way 
intercross of inbred strains. Therefore, one 
limitation of the work of Bou Sleiman et al. 
is that it will miss genetic variants absent 
from inbred strains. Further studies us- 
ing wild-derived mouse populations with 
a larger genetic diversity would thus be of 
great value. Indeed, Bou Sleiman et al. found 
a small number of loci associated with life 
span using the ITP cross of inbred strains, 


Previously unknown loci associated with life span in mice may be relevant to humans living a long life, 
such as Emma Morano, who lived to 117 years. 
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suggesting that in more diverse mouse popu- 
lations there could be many more genetic de- 
terminants of longevity. 

The loci identified by Bou Sleiman et al. 
that modulate life span may contain dozens 
of genes. To further prioritize which genes 
may be causal, Bou Sleiman et al. obtained 
and integrated various other types of data. 
Specifically, they generated gene expression 
profiles of aged mice from the liver, a com- 
monly studied organ in longevity owing to 
its essential functions for maintaining ho- 
meostasis, such as energy metabolism and 
detoxification. Mostly, however, they took 
advantage of multiple publicly available da- 
tasets to gain insights and score the genes 
in each locus that they identified. These 
included differential gene expression from 
the Tabula Muris Senis (7), a single-cell tran- 
scriptomic atlas of mouse aging tissues, and 
longevity genes from mice and orthologs in 
other model organisms from the GenAge da- 
tabase (5). In addition, data on human ortho- 
logs were obtained from the Genotype-Tissue 
Expression (GTEx) project, with tissue- 
specific gene expression including aged tis- 
sues, and from the GWAS Catalog (8), which 
compiles data from genome-wide association 
studies. As such, the study of Bou Sleiman 
et al. showcases the value of these resources 
and the importance of integrating multiple 
types of data to help untangle complex phe- 
notypes, such as life span. 

The comparison of data from multiple or- 
ganisms is also noteworthy. Given that lon- 
gevity is a particularly time-consuming phe- 
notype to study, research has relied on animal 
models, including worms (in which the first 
genes that modulate longevity were discov- 
ered), rodents, and, more recently, dogs (4, 9). 
After scoring and prioritizing the candidate 
genes in the loci modulating life span, Bou 
Sleiman et al. tested several Caenorhabditis 
elegans homologs for life-span effects. They 
found five genes from mice that modulated 
life span in worms when silenced: homeodo- 
main interacting protein kinase 1 (Hipk1); 
dolichyl-di-phosphooligosaccharide-protein 
glycotransferase (Ddost); heparan sulfate 
proteoglycan 2 (Hspg2); FYVE, RhoGEF and 
PH domain containing 6 (Fgd6); and pyru- 
vate dehydrogenase kinase 1 (Pdk1), although 
all except PdkI1 reduced life span and hence 
may be pathological rather than aging re- 
lated. HipkI (hpk-I in C. elegans) acts as a 
regulatory node for chaperones and autoph- 
agy to preserve proteostasis in worms (JO), a 
major longevity pathway. Furthermore, Ddost 
is involved in the processing of advanced gly- 
cation end products that accumulate with 
age and have been associated with several 
age-related diseases such as diabetes mel- 
litus and cardiovascular diseases in various 
models and humans (17). Interestingly, a lo- 
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cus near FGD6 has been found to influence 
health span, parental life span, and longevity 
in humans (22). 

In line with other studies of the ITP re- 
vealing that life span is affected by dietary 
manipulations in a_ sex-specific manner 
(73), most loci that Bou Sleiman et al. found 
to modulate life span also exhibited sex di- 
chotomy. It is unclear why this is the case. If 
a diet or allele slows aging, then why would 
this not be the case for males and females? 
It is possible that more specific processes or 
diseases (i.e., those with sex-specific effects) 
rather than aging-specific processes per se 
are altered when longevity is modulated. 
Moreover, Bou Sleiman et al. found many 
loci that only modulate life span after a given 
age and partial overlap between life-span 
loci and loci associated with growth or body 
weight, likely reflecting the multiple pro- 
cesses and traits that affect the longevity phe- 
notype. Nonetheless, these sex-specific effects 
emphasize the heterogeneity of longevity and 
the need to study diverse populations (74). 

The data generated by Bou Sleiman et al. 
will prove a valuable resource for the study 
of aging and longevity. Many genes are 
thought to contribute to mammalian lon- 
gevity, yet only a (likely small) fraction of 
these are known, and there is still substantial 
unexplained heritability in human longev- 
ity. As such, data sharing and cross-species 
comparisons will foster and enhance future 
longevity studies and lead to more rapid 
progress. Candidates from animal models 
such as worms and mice provide a direction 
for further studies. The question of which 
genes in the loci identified by Bou Sleiman 
et al. contribute to longevity remains unan- 
swered. Likewise, except for FGD6, whether 
the genes identified by the authors to modu- 
late life span in worms are involved in hu- 
man longevity remains to be established. 
Because longevity is a complex, multifac- 
torial phenotype, it will also be important 
to elucidate in the future which processes 
and diseases are affected by genetic vari- 
ants associated with longevity. 
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PLANET FORMATION 


Earth was 
seasoned 
by countless 
blows 


Earth's composition was 
shaped by collisions from 
external objects 


By Zoé Malka Leinhardt 


he formation of planets around dis- 
tant stars is not uncommon. With 
more than 5000 planets detected 
outside of our Solar System, it is esti- 
mated that at least 25% of stars har- 
bor planets. However, astronomers 
have yet to definitively find an Earth-like 
planet with the necessary characteristics 
to support life. Does this apparent rarity 
of Earth-like planets have something to do 
with the way Earth was formed? The his- 
tory of Earth’s formation and composition 
is an ongoing debate. Recent studies have 
found that Earth’s current composition 
does not fully align with what would be ex- 
pected if it was solely made up of the most 
primitive rocky meteorites. On page 1529 
of this issue, Frossard et al. (1) show that 
the chemical makeup of Earth’s crust has 
changed over time as the result of count- 
less impacts with smaller rocky bodies. 

At present, the favored formation model 
for rocky planets, such as Earth, revolves 
around collisions (2). According to this 
view, the process began with a swirling 
disk of dust and gas and a young Sun at 
the center. The dust and gas orbiting the 
Sun originated from a much larger, cooler 
cloud of molecular gas that collapsed 
to form the Sun. Some of the collapsing 
clouds had too much angular momentum 
to fall into the young Sun and instead 
formed a disk going around it. As the disk 
cooled, its solid components accumulated 
into many planetary building blocks, or 
“planetesimals.” The planetesimals col- 
lided with each other often, which resulted 
in accretion and growth, and eventually 
formed rocky planets. 
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During this period of planet formation, 
the accumulation of an entire planet re- 
quired hundreds of thousands of collisions. 
As larger objects grew, they generated more 
gravitational pull on neighboring objects, 
which caused some collisions to be more vi- 
olent, although even the gentlest collisions 
produced some debris (3). These collisions 
were pivotal in shaping the composition 
of planets and the structure of the Solar 
System. For instance, giant impacts are 
the favored explanation for the formation 
of massive moons, including Earth’s Moon 
(4) and Pluto’s largest satellite Charon (5). 
These collisions also played a role in other 
seemingly more subtle details, for example, 
the spin axial tilt of planets, such as that 
of Uranus (6), whose spin axis lies parallel 
to its orbital plane. So, it is not surprising 
to think that these collisions can also affect 
the composition of the planets. 

The key to understanding how colli- 
sions could change the observed composi- 
tion hinges on a planet’s internal structure. 
Planets, including Earth, are not structur- 
ally uniform in composition. Instead, they 
are differentiated or separated into distinct 
layers. For instance, Earth contains a metal- 
rich iron-nickel core surrounded by a man- 
tle of dense magnesium and iron silicate 
and capped with a floating crust of lighter 
silicate. This differentiation—with heavier 
compounds at the center and lighter ones 
on the outer layers—means that lighter 
material is more susceptible to removal by 
collisional events. This vulnerability is es- 
pecially important for the gentler collisions 
that do not penetrate or expose the denser 
interior layers of the planet (7). 

There is, however, an important nuance 
to this picture. Compositional differences 
found in Earth’s crust and mantle are not 
solely based on the mass density of differ- 
ent elements but are also based on the el- 
ements’ compatibility with each other to 
form molecules. For example, some rare 
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Earth elements, such as samarium (Sm) and 
neodymium (Nd), are incompatible with 
mantle minerals and are thus “pushed” up 
to the crust despite being relatively heavy 
elements. Because of this detail, the radio- 
active decay of the isotope “°Sm into “Nd 
has been used in geochemistry to study 
crust and mantle evolution. However, Earth 
has an excess of “Nd (relative to “Nd) 
compared with the meteorites that are con- 
sidered representative of the planetesimals 
that built Earth (8). 

There are three competing explanations 
for the apparent excess of “Nd. One idea 
suggests that the excess is only “appar- 
ent” and more “Nd exists deep within the 
mantle but is beyond the reach of current 
sampling techniques. Another idea is that 
the planetesimals that built Earth were not 
the same in composition because the accre- 
tion disk around the Sun was not uniform; 
it may actually be the planetesimals that are 
poor in “Nd. Another possibility—one sup- 
ported by the findings of Frossard et al.—is 
that the crust of Earth was continuously 
eroded by impacts during its formation, 
which caused a greater loss of elements 
preferentially found in the crust. 

Although both Sm and Nd are incom- 
patible with the mantle, this discordance 
is slightly more pronounced with Nd. 
Therefore, as Earth formed, the ratio of 
Nd in the crust compared with Nd in the 
mantle is higher than the same ratio for Sm. 
Because of this, a larger fraction of “*Nd 
than “°Sm in Earth was removed from the 
crust through collisions. The “Sm left in 
the crust then eventually decayed into “Nd 
and produced the excess of "Nd over “*Nd. 

Frossard et al. argue that there is little 
evidence to support the hidden “‘Nd the- 
ory. They claim that because of the deep 
convective nature of the mantle during the 
final stages of Earth’s accretion, ““Sm and 
™*Nd would have been too well mixed to 
completely explain the excess of “*Nd. To 


Frossard et al. provide evidence that the 
chemical composition of Earth was shaped 
by countless collisions, during which 
elements that were more prevalent on the 
outer crust were repeatedly chipped away. 


determine the amount of compositional 
variation of Earth’s building blocks, they 
used refined mass spectroscopy to analyze 
meteorites thought to be representative 
of the planetesimals that built Earth (9). 
According to the measurements, to explain 
the “°Nd excess in the crust, up to 20% of 
Earth could have been removed by collision 
events during its formation. The authors 
point out that this collisional erosion would 
also remove other rare earth elements that 
tend to accumulate in the crust. This would 
provide an explanation for other mysteries, 
such as the apparent lack of uranium, po- 
tassium, and thorium in Earth’s crust. 
There are clues that Earth is not alone 
in suffering collisional compositional modi- 
fication. For instance, within the Solar 
System, a single large collision is, at pres- 
ent, the favored explanation for the lack of a 
crust and mantle on Mercury (J0). Looking 
beyond the Solar System, giant impacts may 
also be responsible for the high mass den- 
sity of some exoplanets, such as GJ 376 (11) 
and Kepler-107c (72). Although the findings 
of Frossard e¢ al. do not directly explain why 
Earth is perhaps distinctive in its ability to 
support life, it highlights the rich diver- 
sity among rocky planets and how little is 
known about the events that shape them. 
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A framework for sex, gender, and 
diversity analysis in research 


Funding agencies have ample room to improve their policies 


By Lilian Hunt’, Mathias Wullum Nielsen?, 
Londa Schiebinger** 


ational research agencies are re- 

sponsible for promoting excellent 

research that benefits all of society 

(1). Integrating sex, gender, and di- 

versity analysis (SG&DA) into the 

design of research, where relevant, 
can improve research methodology, en- 
hance excellence in science, and make 
research more responsive to social needs 
(2). National funding agencies—encour- 
aged by scientists and social movements— 
have thus begun to implement policies to 
integrate sex, gender, and, more recently, 
diversity analysis into the grant proposal 
process, where these factors have been 
shown to play a role. We develop a five-part 
analytical framework for implementing 
and evaluating SG&DA policies, and use 
it to evaluate the quality of SG&DA poli- 
cies for 22 major national funding agencies 
across six continents. By collecting emerg- 
ing global practices for policy implemen- 
tation, we seek to improve understanding 
of these policies and practices in efforts to 
enhance international collaborations and 
research excellence. 

SG&DA is highly developed in health 
and biomedicine, machine learning, and 
artificial intelligence, and is emerging in 
other fields. Incorporating SG&DA into 
research design has enabled advance- 
ments across numerous disciplines. For 
example, research on pain documents both 
biological aspects (i.e., sex differences in 
electrical, ischemic, thermal, pressure, 
and muscle pain sensitivity) and cultural 
aspects (differences in how people report 
pain and how physicians understand and 
treat pain). Consensus analyses now in- 
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clude sex-based analysis in nonhuman 
animal and human preclinical and clinical 
research, gender-based analysis in patient- 
physician relationships and, more recently, 
analysis of different racialized groups and 
ethnicities in clinical treatment (3, 4). 

SG&DA informs each phase of the re- 
search process—from establishing project 
objectives (e.g., considering the character- 
istics of target populations and the social 
implications of the project), to developing 
methodologies (e.g., ensuring appropriate 
and unbiased measures and instruments), 
gathering data (e.g., sampling sufficient 
participant numbers across categories), 
analyzing data (e.g., considering within- 
and between-group differences and inter- 
secting factors) to reporting results (e.g., 
considering language use and specifying 
how categorical data were collected and 
annotated) (5). 

There are three pillars of the science in- 
frastructure that need to coordinate poli- 
cies to achieve excellence in science (2). 
The first, funding agencies, encourage in- 
tegrating SG&DA at the beginning of the 
research process. In 2003, the European 
Commission (EC) endorsed “questioning 
systematically whether, and in what sense, 
sex and gender are relevant in the objec- 
tives and methodology of projects.” Other 
public-funding agencies followed suit with 
policies implemented at the Canadian 
Institutes of Health Research (CIHR; 
2010), German Research Foundation (DFG; 
2020), and National Research Foundation 
of Korea (NRF; 2021), among others (6). 

Pillar two, peer-reviewed journals, in- 
creasingly consider SG&DA when selecting 
manuscripts for publication (7). Pillar three, 
universities and research institutions, are 
responsible for developing methods for 
this type of analysis and for providing this 
expertise to future generations. Many fac- 
ulties of science, medicine, and engineer- 
ing have not yet integrated knowledge of 
SG&DA into their core curricula; hence, 
many researchers and research evaluators 
lack training in these types of analyses, and 
new guidance is needed to improve this. 


This study focuses on funding agencies 
and develops an analytical framework to 
evaluate the uptake of policies for inte- 
grating sex, gender, and diversity—which 
covers intersectional characteristics such 
as age or life course, indigeneity, race and 
ethnicity, sexuality, socioeconomic status, 
and other axes of inequality—into research 
design. Previous studies have analyzed 
single funding agencies in depth (1, 8-9) 
or focused narrowly on particular regions 
such as sub-Saharan Africa, or more often 
Europe and North America (J0, 11). One 
international study included questions on 
SG&DA policies in their larger survey fo- 
cused on gender equity in research teams 
(12). Ours is the first to develop a cross-dis- 
ciplinary analytical framework for policy 
development and to apply it globally. 


DEVELOPING THE POLICY FRAMEWORK 

We conducted an investigation in three 
stages: (i) document analysis, (ii) global 
survey, and (iii) policy analysis. Based on 
a documented analysis of SG&DA-related 
policies and guidelines and prior research 
(6), we developed our analytical frame- 
work, a five-part guide for implementing 
and evaluating SG&DA policy (see the box) 
(for a full outline of the development pro- 
cess, see supplementary materials (SM) 
$1). We convened an international advi- 
sory group that included representatives 
from public funders, expert researchers, 
and policy specialists from five global ar- 
eas—Africa and the Middle East, Europe 
and Central Asia, Latin America and the 
Caribbean, North America, and South and 
East Asia and the Pacific—to discuss and 
improve the framework’s clarity, specific- 
ity, applicability, and global representative- 
ness (SM S82). 

We tested the applicability of our 
framework for evaluating SG&DA policies 
through a pilot survey targeting six large 
public funding agencies in Europe, East 
Asia, and North America—the CIHR, DFG, 
EC, Irish Research Council, US National 
Institutes of Health (NIH), and NRF 
Korea—four of which had representation 
on our advisory board. The survey was de- 
signed to collect all relevant information 
on SG&DA-related activities done by the 
agencies. Evidence consisted of publicly 
available policies and guidance documents, 
and, where agencies were in the process 
of developing policy, internal documents. 
We checked all self-reported responses 
against agency documents. The pilot study 
confirmed the applicability of our evalua- 
tion tool and also led to minor revisions in 
item phrasing. 

For the global evaluation, we consulted 
with our advisory board to select 39 ad- 
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ditional funding agencies based on geo- 
graphic spread with the goal of covering 
agencies from all continents that host such 
agencies (SM S4). We focused on public 
agencies as these are typically the larg- 
est, most prestigious, and agenda-setting 
funders at the national level. Of the 39 in- 
vited, 16 agreed, yielding a final sample of 
22 (SM S5). 

Response rates differed across regions. 
In North America, all 5 invited agencies 
participated; in Europe and Central Asia, 9 
of 14 agencies responded; in Latin America 
and the Caribbean, 2 of 3 agencies re- 
sponded; in South and East Asia and the 
Pacific, 4 of 7 agencies responded; and in 
Africa and the Middle East, only 3 of the 11 
invited responded (table S1). 

As part of our participatory design, 
funders were invited to complete an online 
questionnaire (SM S8 to S11) and asked to 
provide evidence (either publicly available 
or internally agreed upon) for each answer. 
Each agency was scored on their perfor- 
mance across the five parts of the policy 
framework by at least two evaluators using 
a scoring matrix (SM S6 and S12) vetted by 
the advisory board. The scores of individ- 
ual funding agencies are kept confidential 
per agreement. 

Overall, one agency scored in the first 
tier (81+ points out of 100) in SG&DA poli- 
cies and measures; six agencies scored in 
the second tier (61 to 80 points), five in 
the third tier (41 to 60 points), two in the 
fourth tier (21 to 40 points), and nine are 
just beginning (0 to 20 points) (see the 
figure). Globally, almost half of agencies 
(9 of 22) provided definitions of terms of 
reasonable quality (first and second tiers) 
(see the figure) (table $2). Similarly, almost 
half of agencies (9 of 22) provided quality 
proposal guidelines for applicants. Only 
about a third of agencies (7 of 22) provided 
quality instructions to evaluations, and 
even fewer (5 of 22) provided quality train- 
ings for applicants, evaluators, and staff. 
Evaluation of policy implementation was 
the weakest, with only two agencies scor- 
ing in the second tier and none in the first. 
Many agencies have begun implementing 
policy, but most have not considered evalu- 
ation. It is important that agencies plan to 
evaluate policies from the very beginning. 

One complicating factor in comparing 
agencies is that some agencies span all fields 
of the human and natural sciences, technol- 
ogy, and health and biomedicine, whereas 
others are more specialized. When we di- 
vided our data by agency type, however, we 
found no pronounced differences, suggest- 
ing that funders with wide remits can suc- 
cessfully implement these policies (fig. S1). 

Each component of our five-part frame- 
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A framework to implement 


and evaluate policies 

The framework covers five aspects of public 
funding agencies’ efforts to promote sex, 
gender, and diversity analysis (SG&DA). 


Definition of terms 
* Clear and quality definitions 


* Definitions readily available 


Proposal guidelines for applicants 

+ Instructions to applicants to include SG&DA 
* Encourage or require? 

* Examples given 


* Specify how SG&DA is included at each 
stage of the research cycle—detail for yes 
and justify for no 


Instructions for evaluators 

¢ Instructions for reviewers to include SG&DA 
in their evaluations 

« Assessment at each stage 
of the research process 


* Monitoring 


Trainings for applicants, evaluators, 

and staff 

¢ Training, resources, and support available 
for applicants 

¢ Training, resources, and support available 
for proposal evaluators 

* Training, resources, and support available 
for relevant agency staff 

¢ Training mandatory through certification 

* Development of open access resources: 

courses and high-quality materials 


Evaluation of policy implementation 

¢ Number and proportion of proposals that 

include SG&DA 

¢ Number and proportion of proposals that 

include quality SG&DA 

* Quality of evaluators’ scoring & comments 

¢ Number of applicants, evaluators, & staff 

who engaged in training 

¢ Number and proportion of publications 
from funded proposals that include SG&DA 


work process for policy development is im- 
portant to the success of policy implemen- 
tation. A simple policy mandate to include 
SG&DA in research is itself not enough, 
and, when poorly executed, SG&DA 
can lead to harm (/3). Further, funding 
agency policies are but one part of behav- 
ioral and cultural shifts in the research 
endeavor and need to be supported by co- 
ordinated change across the broader sci- 
ence infrastructure. 


EMERGING GLOBAL PRACTICES 

Each agency will develop country-specific 
policies that accommodate their cultural 
practices and regulatory landscapes. At the 
same time, agencies can share policies and 


practices to enhance research collaboration 
across regions. Here we explore key consid- 
erations for each part of our policy frame- 
work and track emerging policies. These 
considerations are drawn from agencies’ 
responses to our questionnaire (SM $7). 

Overall, we found differences across coun- 
tries as to where these policies sit within 
agencies, how they are implemented, and 
where accountability for them lies. Most 
funders in our study—for example, the DFG 
and EC—include SG&DA under their “excel- 
lence” criteria; i.e., SG&DA is judged in rela- 
tion to its potential to create new knowledge. 
CIHR, for example, found that once their 
SG&DA policy was established, applicants 
who scored well on the SG&DA question 
scored well overall, i.e., SG&DA improved the 
overall quality of the proposal, and the pro- 
posal was more likely to be funded (9). 


Definition of terms 
The first step in policy development is defi- 
nition of terms (see the box). It is important 
that the same definitions are shared with ap- 
plicants, evaluators, and staff to support con- 
sistency across the agency. For example, the 
Canadian Tri-Agency provides guidance in 
both English and French describing Gender- 
Based Analysis Plus (GBA+) that links across 
agency materials (all references to policy 
documents are available in table S3). Which 
terms and how they are defined will be coun- 
try specific. For “diversity,” for example, each 
agency will determine which factors to pri- 
oritize and for which funding areas. Some 
agencies, such as those in Australia, synchro- 
nize their guidelines with those set through 
national legislation. [See (J4) for example 
definitions of key terms, and links between 
agency policies and this five-part framework. ] 
We chose to evaluate agencies on “sex” 
and “gender” analysis because sex and 
gender were historically the categories in- 
cluded in agency policy—for example, by 
the EC. Although sex is a biological charac- 
teristic of humans and numerous nonhu- 
man organisms and is an important cat- 
egory to continue to call out, gender is at 
the same epistemic level as other aspects 
of sociocultural diversity and may, in the 
future, be folded into diversity analysis. 
We included both gender and diversity to 
allow us to evaluate developed and emerg- 
ing policy practices. Second, we considered 
evaluating on “intersectional” analysis but, 
after policy reviews and consultation with 
our advisory board, judged “diversity” a 
more generally used term. 


Proposal guidelines for applicants 

Agencies take three basic approaches in 
their request to applicants to integrate 
SG&DA into their proposal, where relevant: 
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Most encourage applicants to integrate 
SG&DA; a few require this type of analy- 
sis; some only encourage applicants but 
instruct evaluators to score this element. 
In this study, more points were awarded 
to funders who required SG&DA; how- 
ever, more research is needed to deter- 
mine the differential impact of these vari- 
ous approaches. In all cases, the “where 
relevant” is crucial. No agency asks for 
SG&DA in pure mathematics, for example, 
where no body of literature has established 
its relevance. 

The trajectory of the EC is of interest. 
Since 2003, the Commission has encour- 
aged sex and gender analysis, referred to 
as the “gender dimension,” in research. 
To strengthen the policy, in 2014, Horizon 
2020 flagged topics for which taking the 
gender dimension into account was man- 
datory. Since 2021, Horizon Europe has 
required all proposals to consider sex, 
gender, and/or intersectional analysis in 
research, unless otherwise specified. 

The US NIH has, since 2016, required 
all applicants to consider “sex as a biologi- 
cal variable” (SABV) and has detailed how 
this type of analysis supports excellence 
in science (1). This requirement supple- 
ments policies for inclusion of sex and 
gender, race, and ethnicity in clinical tri- 
als launched in the 1990s and inclusion of 
age (Inclusion Across the Lifespan) added 
in 2019. 

The DFG implemented its SG&DA guide- 
lines in 2020 after a 2-year study period. 
The DFG encourages but does not require 
applicants to consider SG&DA, emphasiz- 
ing that it funds “proposals in curiosity- 
driven basic research” in fields selected 
by applicants where freedom of research 
is core. In so doing, the DFG taps into the 
issue of academic freedom, where princi- 
pal investigators set their own research 
agendas. Notably, SG&DA is one method- 
ological tool among many that researchers 
may employ. Like any other methodology, 
appropriate application is key to research 
outcomes. It is the job of researchers and 
research evaluators to determine when 
SG&DA may enhance research outcomes. 
Interestingly, the DFG requires research 
evaluators to take SG&DA into account, 
making SG&DA analysis de facto required 
if relevant to research outcomes. 

In many countries, agency policy is gov- 
erned by national legislation. In Japan, 
where the government renews the ba- 
sic plans for Science, Technology, and 
Innovation (STI) every 5 years, the sixth 
STI Basic Plan in 2021 included integrating 
the gender perspective and gender analysis 
into research and technology development. 
Similarly, in 2021, the Republic of Korea 
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Agencies’ performance on the policy framework 

Using a scoring matrix (see supplementary materials), funding agencies were awarded points reflective of 
their performance on each of five parts of the sex, gender, and diversity analysis (SG&DA) policy framework 
(bottom five bars). Those five component-level scores were aggregated into an overall score (top bar). For each 
component and the aggregate, agencies were assigned to five tiers based on the percentage of the total pos- 
sible points that they received for that component or aggregate. Bars reflect the number of agencies (out of 22 


in total) whose scores fall within a given tier for that component or aggregate. 


@ 15 tier (1-100%)  @ 24 tier (61-80%) 


© 31 tier (41-60%) 


© 4th tier (21-40%) 


@ 5th tier (0-20%) 


Number of funders scoring in each tier 


Definition of terms 


for applicants 


Instructions for 
evaluators 


Trainings for applicants, 
evaluators, and staff 


Evaluation of policy 
implementaion 


passed an amendment of the Framework 
Act on Science and Technology to include 
integrating sex and gender into research, 
which allowed the NRF Korea to develop 
new policies. 

In our study, we evaluated whether appli- 
cants are instructed to detail how SG&DA 
analysis is incorporated into all phases of 
research. If SG&DA is not relevant to the 
proposed research, applicants should be 
asked to provide literature to demonstrate 
that no sex, gender, or relevant diversity 
differences have been found. 


Instructions for evaluators 

Evaluators are crucial to the success of 
SG&DA policies. CIHR found that “target- 
ing applicants alone to adopt new science 
policies without concomitant pressure 
by evaluators...may not be effective” (9). 
Since 2018, CIHR has required evalua- 
tors to rate the quality of the SG&DA as a 
“strength,” “weakness,” or “not applicable” 
and to provide a rationale for their rating 
along with recommendations to applicants 
for improvement. 

Funders should provide applicants and 
evaluators similar forms and instructions 
for consistency across the research process. 
Some agencies, such as the EC, are limited 
in the overall instructions they can pro- 
vide on this particular requirement given 
the number of topics that need to be cov- 
ered. Agencies may provide “good research 
guides” that reference assessing SG&DA 
alongside other elements of peer review, 
such as ethics and reproducibility. 

Agencies must monitor the evaluation 


5 


process to confirm that SG&DA is addressed 
in reviewer comments and that those com- 
ments are high quality. 


Trainings for applicants, evaluators, 

and staff 

SG&DA is not yet consistently part of uni- 
versity curricula in the physical and life 
sciences, health and biomedicine, and 
engineering. Until universities step up 
to the task, funding agencies need to fill 
this gap. The most comprehensive online, 
interactive agency trainings to date are in 
health and biomedicine. Trainings are also 
available for certain areas of the natural 
sciences, engineering, computer science, 
and environmental sciences, but more are 
needed. Agencies can evaluate the effec- 
tiveness of the training by including a pre- 
test and a post-test. CIHR, for example, 
found that the majority of participants who 
completed training modules demonstrated 
improved knowledge of sex and gender 
analysis (15). Most trainings are voluntary; 
however, some funders require applicants 
to submit a certificate of completion for 
large, strategic competitions. 

Use of the same training materials by ap- 
plicants, evaluators, and agency staff helps 
to ensure consistency in policies, terminol- 
ogy, and expectations. Agencies can coor- 
dinate and share trainings internationally; 
there is no need to duplicate efforts, except 
where specific cultural practices require a 
particular approach. 

Some agencies foster training in this area 
through research institutions. The US NIH, 
for example, has invested $160 million in 


science.org SCIENCE 


GRAPHIC: N. CARY/SCIENCE 


Specialized Centers of Research Excellence 
across 25 research institutions to “train re- 
searchers in experimental design and analy- 
ses that consider sex and/or gender” (1). 


Evaluation of policy implementation 

Only three agencies in our study had per- 
formed policy implementation evaluations. 
A further nine were in the planning stages; 
10 had no plans in place. We strongly rec- 
ommend that agencies implement evalu- 
ation plans as they develop policies to 
facilitate appropriate quantitative and 
qualitative evaluation. 

Our framework sets out five aspects for 
the evaluation of policy implementation: 

1) The number and proportion of pro- 
posals that include SG&DA. CIHR found 
that from 2011 to 2019, the proportion of 
grants including sex analysis increased 
from 22 to 83% and grants including gen- 
der analysis from 12 to 33%. The level of 
integration differed across sectors, with 
the lowest in biomedical and the highest 
in clinical research (8). 

2) The number and proportion of pro- 
posals that include quality SG&DA. The 
EC conducted a mid-term evaluation of 
sex and gender analysis for Horizon 2020 
in 2017 to consider the proposal qual- 
ity, methods, impacts, dissemination, and 
also whether the project advanced meth- 
odology for sex and gender analysis in 
the particular field in which the proposal 
was submitted. Overall, the EC concluded 
that the quality of the gender dimension 
in project proposals was not high and 
that more training was needed (9). The 
EC also experimented with computer- 
assisted textual analysis given the volume 
of applications per year. These evaluation 
methods are in their infancy and require 
further development. 

3) The quality of evaluators’ scoring 
and comments. This aspect emerged as a 
central point in the analysis of agencies’ 
self-reported activities. We did not score 
funders on this point but will in future 
iterations. CIHR manually sampled 5% of 
evaluators’ comments to check the quality 
of responses (9). The EC reviewed the ef- 
fectiveness of review panels and found that 
only 36% considered the gender dimension 
and, of those, 70% included a gender ex- 
pert, suggesting that review panels require 
guidance from experts (8). 

4) The number of applicants, evalua- 
tors, and staff who engaged in trainings 
and in what type of training. If possible, 
the correlation between applicant train- 
ing and the success of applicant proposals 
should be assessed. Some funders, such 
as the Spanish Carlos III Health Institute, 
reported in our questionnaire that they 


SCIENCE science.org 


monitor the number of applicants who 
participate in SG&DA training and are set- 
ting targets for improvement. 

5) The number and proportion of peer- 
reviewed publications (or other recog- 
nized modes of dissemination) that result 
from funded proposals that incorporated 
SG&DA. To monitor this, funders will need 
to track papers and research outputs using 
grant numbers. The Science Foundation 
Ireland reported in our study that they col- 
lect researcher-reported publication data 
to check that proposals that included sex 
and/or gender analysis reported those re- 
sults in publications. 

A trend that we continue to watch is the 
broadening of sex and gender analysis to 
include other social dimensions. Already, 
the EC has added “intersectional” analysis 
to their gender dimension; these policies, 
however, remain under the broader gender 
equality strategy. The DFG started with 
sex, gender, and “diversity” on equal terms. 
The US NIH has included “age,” which 
they term “Inclusion Across the Lifespan.” 
A number of funding agencies, such as 
the Natural Sciences and Engineering 
Research Council of Canada, incorporate 
research design policies under a broader 
equity, diversity, and inclusion (EDI) um- 
brella. In the past, EDI has typically fo- 
cused on “who” is doing the research, not 
on “how” research is done. This means 
that special care will be needed to expand 
EDI to include research methodologies. 


FUTURE FRAMEWORK DEVELOPMENT 

Our evaluation of SG&DA policy implemen- 
tation has some limitations that restrict its 
generalizability. First, our coverage of pub- 
lic funding agencies, though geographi- 
cally comprehensive, is not exhaustive. 
Participation rates were especially low for 
agencies in Africa and the Middle East, 
and we were unable to establish contacts 
with funding agencies in Russia and China. 
Second, we excluded private funders in this 
first round. Future studies might consider 
large international private funders, such 
as the Bill and Melinda Gates Foundation, 
Novo Nordisk Foundation, Oswaldo Cruz 
Foundation, and Wellcome Trust. Third, 
we might add elements to each of the five 
parts of our framework. For example, are 
applications that include SG&DA funded 
at rates and amounts comparable to those 
of other similar projects? Fourth, poli- 
cies that encourage scientists to change 
their practices can spur resistance. Future 
frameworks might include assessing how 
funding agencies deal with such resis- 
tance. Finally, our study details how well 
and how consistently SG&DA policies are 
implemented, but further quantitative and 


qualitative analysis is needed to correlate 
the framework’s scoring with the impact 
of agency policies on research outputs. 
Current funder evaluation policies do not 
support this type of analysis. 


CONCLUSION 

National public funding agencies seek differ- 
ent ways to support the progress of science 
and technology and advance health, pros- 
perity, and well-being by promoting discov- 
ery and innovation, excellence, and societal 
relevance in research. SG&DA is one set of 
methods among many that researchers will 
deploy to help achieve these goals. Funding 
agencies are rapidly developing policies to 
promote SG&DA. The goal of our work is to 
assist in collaboratively achieving efficient 
implementations of SG&DA-related funding 
policies globally. 
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Revisiting the first dinosaur park 


A once-ridiculed sculpture display is worth a second look 


By Ilja Nieuwland 


he Crystal Palace Dinosaurs—a se- 
ries of life-size statues that rendered 
extinct prehistoric species three- 
dimensionally for the first time— 
made their public debut in 1854 on 
the grounds of the Crystal Palace in 
the London suburb of Sydenham. Despite 
its moniker, the 30-piece collection includes 
only four dinosaurs: two iguan- 
odons, a megalosaurus, and a 
hylaeosaurus. Based on fragmen- 
tary fossil material, the models 
were later revealed to be wildly 
inaccurate, and the installation 
was widely ridiculed in the fol- 
lowing decades. In recent years, 
however, the statues have expe- 


Museum, it succeeds in compiling various 
narratives about the collection and criti- 
cally comparing them with archival docu- 
mentary evidence and physical analyses of 
the sculptures. 

One oft-repeated myth surrounding the 
collection concerns the alleged close col- 
laboration between Benjamin Waterhouse 
Hawkins, the sculptor who created the 
statues, and paleontologist Richard Owen, 
who was hired as a professional 
consultant on the project. As 
Witton and Michel demonstrate, 
Owen showed only very occa- 
sional interest in the project, and 
Waterhouse Hawkins should be 
considered the main, if not the 
sole, creator of the statues. 

The book also examines other 


: . ; : The Art and Science ees : 
rienced a reappraisal as histori- of the Crystal Palace  @Spects of the exhibition, in- 
cally important objects. Dinosaurs cluding its construction and 

The story of the Crystal Palace Mark P. Witton and preservation. The authors pur- 
Dinosaurs has been told, embel- Ellinor Michel posely de-emphasize the dino- 

; ‘ The Crowood Press, 

lished, and retold to the point 2022. 192 pp. saur sculptures, by far the most 


where mythology and facts start 

to merge. A new book, The Art and Science 
of the Crystal Palace Dinosaurs, seeks to 
set the record straight on these important 
reconstructions. Written by Mark Witton, 
a paleontologist and paleoartist at the Uni- 
versity of Portsmouth, and Ellinor Michel, 
a conservator at London’s Natural History 


The reviewer is at the Huygens Institute, Royal Netherlands 
Academy of Arts and Sciences, Amsterdam, Netherlands. 
Email: ilja.nieuwland@huygens.knaw.nl 
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frequently discussed of the col- 
lection, a strategy likely meant to prevent 
the more famous denizens of the park from 
overwhelming the others. 

Witton’s interest in the Crystal Palace Di- 
nosaurs is not coincidental; as a paleoartist, 
he is part of a movement that seeks to pro- 
mote “greater originality and bolder specu- 
lation in paleoartworks” (7). As such, he is 
sympathetic to Waterhouse Hawkins’s plight, 
noting that the artist had to extrapolate from 
fragmentary material and limited data. 


= Visitors take in the Crystal Palace Dinosaurs, 
© the first 3D models of prehistoric creatures. 


Each of the installation’s animal statues 
is subjected to an analysis of its conception, 
construction, reception, and preservation. 
The statues are also considered against 
contemporary reconstructions rendered by 
Witton. This serves to emphasize how fun- 
damentally reconstructions have changed 
in some cases, such as the dinosaurs iguan- 
odon and megalosaurus (chapter 7). In 
other cases, the differences are more super- 
ficial (e.g., Megatherium, chapter 4). 

Although the book discusses the public 
reception of the exhibition, it tends to un- 
dersell the staging of the exhibits and its 
influence on the public. What it gets right, 
however, is its discussion of the connections 
between modern popular paleontology and 
that of the late 19th century. A prime exam- 
ple is the exhibition’s accompanying mer- 
chandise, which included scale models of 
extinct animals. This practice is now ubiq- 
uitous in modern natural history museums. 

The book’s final chapter, which dis- 
cusses the preservation of the Crystal Palace 
Dinosaurs, is perhaps the most urgent. Up 
until the end of the 20th century, the authors 
note, the models were subjected to “rela- 
tively callous treatment,’ resulting in sub- 
stantial damage, and by the time the collec- 
tion achieved heritage status in 1973, seven 
statues had vanished. Meanwhile, the con- 
dition of the remaining statues continues to 
deteriorate as a result of vandalism, weeds, 
moisture, and a general lack of preservation. 
The authors argue passionately for greater 
investment in this consequential collection, 
pledging to donate the proceeds of this book 
to a charity dedicated to their conservation 
and interpretation. © 
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Spatial thinkers receive their due 


An insider’s guide to visual reasoning celebrates 
those who think in pictures and patterns 


By Jonathan Wai 


ow we come to see the world can 

sometimes prevent us from seeing it 

in other ways. This idea is at the heart 

of Temple Grandin’s new book, Visual 

Thinking, in which she explains how 

she sees the world. Leading readers 
through a dizzying array of scientific and 
other literature, she argues that we are not 
identifying and developing the talents of 
those who are visual thinkers (“people who 
think in pictures and patterns”), giving pref- 
erence instead to verbal thinkers (“people 
who think in words”), and this has conse- 
quences for many areas of society. This idea 
is supported by a long history of research 
on spatial reasoning and spatial think- 
ing (/-4). She notes that her inspiration for 
Visual Thinking came from Thomas West’s 
influential 1990 treatise In the Mind’s Eye, 
which profiled some of the most notable vi- 
sual thinkers in history and connected visual 
thinking with neurodiversity (5). 

Grandin discusses a wide array of visual 
thinking engineers and inventors, including 
famous examples such as Thomas Edison 
(a lifelong hero of hers), as well as builders 
and innovators she has met in her work. 
She writes, for example, about a man with 
dyslexia who reports that he was saved by a 
welding class in high school. He now owns a 
metal fabrication company, has multiple pat- 
ents, and can build just about anything with- 
out needing to sketch out designs in advance. 

A core theme that threads the book’s 
chapters is how diverse perspectives are 
useful for scientific and societal problem- 
solving and how people with different tal- 
ents are often necessary for seeing a prob- 
lem’s various facets. Here, she describes 
the accomplishments of American inven- 
tors Russell and Sigurd Varian. Russell’s 
theoretical and _ technical knowledge 
complemented Sigurd’s mechanical skill, 
allowing the brothers to pioneer micro- 
wave and radiation therapy devices. The 
key, she argues, is to ensure that there is 
mutual respect for different viewpoints, 
which improves productive communica- 
tion in teams. 


The reviewer is at the Department of Education Reform 
and Department of Psychology, University of Arkansas, 
Fayetteville, AR 72701, USA. Email: jwai@uark.edu 
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The book also reviews research on ge- 
nius and neurodiversity and offers an 
update on how Grandin’s thinking has 
evolved regarding the importance of un- 
derstanding and appreciating animal con- 
sciousness. She argues here that animal 
thinking and sensing are largely visual and 
that this shared perspective helps her forge 
deep connections with them. 

In chapter 6, Grandin provides an incred- 
ible look into how visual thinkers, and specif- 
ically those she calls “object thinkers’—indi- 
viduals like herself who conceive of the world 
through photorealistic images—are able to 
visualize systems and troubleshoot an array 
of problems, ranging from those that emerge 
on mechanical assembly lines and during 
plane flight disasters to those that arise from 
decaying building infrastructure and precede 
bridge collapse. Here she describes how the 
James Webb telescope was delayed, in part, 
because of poor workmanship. When sub- 
jected to a shake test, bolts and fasteners fell 
off. She argues that a good object visualizer 
could have predicted this and designed more 
appropriate fasteners. 

Building on Grandin’s rich examples and 
emphasis on helping visual thinkers, espe- 
cially those who are neurodiverse, it is impor- 
tant to consider ways we might help spatially 
talented students in K-12 schools, where 
roughly 2 to 3 million have talents that go 


Visual Thinking: 
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relatively underdeveloped by curriculum that 
emphasizes mathematical and verbal reason- 
ing over spatial and visual thinking (2, 4, 6). 
Such individuals could fill the ranks of the 
hands-on occupational needs that Grandin 
highlights throughout her book. 

Despite its strengths, the book misses the 
opportunity to connect to the broader litera- 
ture on spatial reasoning [e.g., (1-4, 6)], and 
almost everything is filtered through the 
particular way that Grandin sees the world. 
Grandin’s writing is rich in description, how- 
ever, and I could almost fully imagine many 
of the things she set out to explain, even if my 
capacity to do so as a primarily verbal thinker 
was limited. 

All of us could benefit from realizing 
that we need different kinds of minds to 
solve society’s biggest problems. Hopefully 
this book will inspire readers to look at the 
world in different ways so that we might 
better recognize the many assets each of us 
brings to the table. 
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Black-eyed peas appear in many cuisines in the African diaspora. 


PODCAST 


High on the Hog: 

A Culinary Journey from 
Africa to America 
Jessica B. Harris 
Bloomsbury, 2011. 304 pp. 


Understanding how and why crops, 
cuisines, and cooking practices 
spread around the world is a topic of 
interest to anthropologists and agri- 
cultural scientists alike. This week on 
the Science podcast, Jessica Harris 
explores the food practices of the 
African diaspora, offering insight 
into the roles played by the trans- 
atlantic slave trade, a discussion of 
power and reclaiming history, and 
lessons in food sustainability. 
https://bit.ly/3LmnOut 


10.1126/science.ade8448 


30 SEPTEMBER 2022 * VOL 377 ISSUE 6614 1497 


Edited by Jennifer Sills 


Heat waves fuel fires on 
monoculture plantations 


This summer, Europe broke high tem- 
perature records across the continent (7) 
and experienced the largest widespread 
drought in 500 years (2). The extreme 
climatic conditions contributed to an 
increased number and extent of forest 
fires across Europe (3). Large forest fires 
are boosted not only by warmer and dryer 
conditions but also by the type and state of 
the vegetation (i.e., fuel models) (4). Many 
recent forest fires worldwide have been 
fueled by large areas of evergreen mono- 
culture plantations, such as Eucalyptus 
spp. in Chile and Portugal in 2017 (5) and 
Pinus spp. in France this past summer 
(6). Given increasing temperatures and 
fire risks, the traditional practices for 
evergreen tree plantations are no longer a 
responsible forest management strategy. 
Evergreen trees have been shown to 
serve as efficient fuel for fires. Specific plant 
traits, such as essential oils or enhanced 
terpene content in Eucalyptus spp. and 
Pinus spp. leaves, respectively, increase the 
trees’ flammability (canopy ignition and the 
subsequent resprout or seed release help the 
species to outcompete others) (7). The higher 
evapotranspiration rates of evergreens com- 
pared with native forests translate to lower 
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soil humidity and dryer conditions (8). In 
addition, monoculture plantations often 
suffer from poor or nonexistent design and 
forest management, which translates into 
high tree densities, species homogeneity, 
and increased continuity of fuel loads (9). 

Evergreen monoculture plantations have 
increased worldwide over the past two 
centuries (J0). In many cases, they are the 
selected species for large-scale reforesta- 
tion for erosion control, carbon seques- 
tration, or restoration of burned areas. 
Current environmental policies continue 
to favor these strategic responses, so addi- 
tional monoculture plantations are likely 
[e.g., (11)]. However, given the fire risks 
they pose and the increasing probabilities 
of heatwaves, the lack of innovative design 
and management strategies for these ever- 
green monocultures is creating the perfect 
conditions for larger and more devastat- 
ing forest fires worldwide. 

Scientists, public administrations, and 
the forestry industry must reconsider tree 
planting design, strategies, and manage- 
ment actions when dealing with evergreen 
monoculture plantations. Otherwise, we 
might be just casting more wood on the 
bonfire. 

J. Barquin*, L. Concostrina-Zubiri, |. Pérez-Silos, 
G. Hernandez-Romero, A. Vélez-Martin, 
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A low-carbon future for 
China’s tech industry 


The information and communications 
technology (ICT) industry, which includes 
telephone and computer networks and 
devices, has the potential to reduce emis- 
sions and increase efficiency in other 
industries by using internet platforms to 
connect equipment, production lines, fac- 
tories, suppliers, products, and customers 
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INSIGHTS 


In these maritime pine (Pinus pinaster) 
plantations in southwest France, older trees 
have withstood fire better than younger 

trees, indicating that tree age and management 
practices could affect flammability. 


(1). The ICT industry is expected to help 
reduce global carbon emissions by 20% by 
2030 (2). However, the industry’s own car- 
bon emissions could undermine its positive 
contributions. Although carbon emissions 
produced by the ICT industry accounted 
for only about 1% of total global carbon 
emissions in 2007, without interventions, 
the number could reach 14% by 2040 

(3). In China, where the ICT industry is 
growing rapidly (4), the government must 
ensure an environmentally responsible 
transition. 

In August, to accelerate the low-carbon 
transformation of the ICT industry, the 
Chinese government issued the Action Plan 
for Green and Low-Carbon Development 
in the Information and Communication 
Industry (2022-2025), which puts forward 
a series of targets (5). For example, the 
plan proposes that China’s 5G operators 
increase the network infrastructure (such 
as fiber optic cables, ducts, and towers) 
that they share from 58.9% in 2021 (6) to 
no less than 80% in 2025. This goal will 
help to prevent redundant construction by 
multiple companies. The plan also provides 
efficiency targets for data centers, which 
currently consume large amounts of energy 
for cooling and lighting. 

To meet the proposed targets, China 
must make strategic decisions about the 
ICT industry’s growth. To improve the 
energy efficiency of ICT infrastructure and 
reduce carbon emissions from duplicate 
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construction, the country should continue 
to promote the sharing of an appropriate 
number of large data centers, communica- 
tion base stations for wireless equipment, 
and communication equipment rooms. In 
addition, China should ensure that the ICT 
industry’s supply chain uses low-carbon 
technology. Suppliers should be encour- 
aged to adopt efficient technologies, such 
as intelligent temperature controls, and 

to use Internet of Things technology to 
automate and digitally manage the produc- 
tion and transportation of equipment. All 
steps in the supply chain should use clean 
energy, including biomass (7), as much as 
possible. Finally, the ICT industry should 
take advantage of the possibilities it offers 
to other industries. For example, ICT enter- 
prises should embrace online meetings and 
telehealth visits. 

Addressing climate change requires 
broad global participation and joint action. 
China has become one of the world’s lead- 
ing countries in information technology 
(8). Developing its ICT industry with inten- 
tion, and supporting other developing 
countries facing similar ICT challenges, 
can lay a solid foundation for achieving 
national and global emission reduction 
targets. 
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Standards needed for 
antibiotics in water 


Antibiotics are frequently used to treat 
or prevent bacterial infections in humans, 
animals, and crops. Later, the drugs end up 
in the environment, particularly the water 
supply, through wastewater, excrement, or 
groundwater. Because testing for antibiot- 
ics in drinking water (even bottled water) 
is rare, people consume them unknowingly 
(1). Extensive human exposure to low-dose 
antibiotics can lead to decreased response 
to antibiotics when they are needed (2), 
and antimicrobial resistance is growing (3). 
Therefore, governments worldwide should 
work to identify, prevent, and remediate anti- 
biotic contamination of water sources (4). 

Antibiotics in water have been found 
across the world. Commonly used antibiotics 
have been identified in rivers in 75 countries 
across six continents, in concentrations as 
much as 300 times the level deemed safe for 
human consumption (5). In China, thousands 
of tons of antibiotics are released into the 
environment annually (6), and the numbers 
could double by 2030 (7). Antibiotics used by 
the agriculture, aquaculture, and livestock 
industries, as well as those consumed by 
humans for medicinal purposes or in food 
(2), eventually end up in waterways (8). 

To prevent further antibiotics contamina- 
tion, national governments should limit 
antibiotic use in livestock and aquaculture 
(9). In addition, national governments 
should draw up strict, quantitative standards 
for commonly used antibiotics, develop 
low-cost technologies to test for them, and 
conduct routine monitoring of rivers, lakes, 
and drinking water sources to identify 
contamination. By collecting detailed infor- 
mation about which antibiotics are in the 
water and in what quantities, governments 
can pinpoint the water sources that should 
be avoided and install filters to remove 
antibiotics from drinking water. 
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Schoolchildren in Puerto Maldonado, Peru, use Foldscopes to inspect plant material they collected on their playground. 


2022 Golden Goose Award honors serendipitous science 


Ceremony recognizes unexpected breakthroughs on pain relievers, microscopes, and laser surgery 


By Andrea Korte 


A powerful pain reliever called ziconotide, which serves as an alter- 
native treatment to opioids, is derived from an unusual source: the 
venom of cone snails. 

The roots of the discovery of ziconotide stretch back to the Phil- 
ippines in the 1970s. However, “we weren't aiming to develop a pain 
compound,” said researcher Michael Mcintosh. 

Studying cone snails was initially a detour for researchers 
Baldomero Olivera and Lourdes Cruz. Olivera, who was splitting 
his time between the United States and the Philippines, found his 
research on DNA synthesis hampered by a lack of access to equip- 
ment and supplies in Manila. He found inspiration in his childhood 
interest in shell collecting. Cone snails are abundant in the Philip- 
pines, and their venom, which they use to trap fish, worms, and 
other mollusks, could be a rich avenue for research. 

A number of breakthroughs emerged from the multidecade, 
multinational collaboration begun by Olivera and Cruz, which was 
bolstered by access to raw materials in their Philippines lab and 
leading-edge technology for analysis of the venom at their Univer- 
sity of Utah lab. Among their findings: determining that the venom 
itself was made of peptides—very small, tightly bound proteins. 

Building upon this work were other researchers, including 
McIntosh and Craig Clark, both of whom joined the lab as under- 
graduates. Pursuing his interest in brain science, Clark injected 
different combinations of cone snail venom peptides directly into 
the brains of lab mice to a surprisingly wide range of results: Some 
combinations caused shaking, others made mice lethargic, others 
made them excitable. McIntosh homed in on the particular peptide 
that caused shaking—purifying it, identifying its chemical structure, 
and dubbing it omega-conotoxin. 
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Over the years, the lab, in partnership with collaborators, uncov- 
ered more about omega-conotoxin and its effects—including its use 
as a potent pain reliever for patients with chronic illnesses. Observ- 
ing the effects of conotoxins in animals has also helped researchers 
find new ways to map the body’s nervous system. 

Now, the researchers have been recognized with the 2022 
Golden Goose Award, which honors federally funded research that 
unexpectedly benefits society. 

Funding from the National Institute of General Medical Sciences 
of the National Institutes of Health as well as the Department of 
Defense was indispensable to their discoveries, the researchers 
acknowledged. 

“It wasn’t that easy to analyze small amounts of proteins at the 
time,” said Olivera. “But the fact that we finally got an NIH grant, 
that | would say was what really allowed us to discover things that 
were unexpected.” 

“Tiny Snail, Big Impact: Cone Snail Venom Eases Pain and Injects 
New Energy into Neuroscience” was one of three Golden Goose 
Award-winning research projects recognized at the 14 September 
ceremony hosted by the American Association for the Advance- 
ment of Science. In attendance at the Washington, D.C., ceremony 
were Olivera and McIntosh, who continue to work together at the 
University of Utah, and Cruz, who was named a National Scientist 
of the Philippines in 2006. (Clark passed away in 1994.) 

“This is that wonderful time of year when we shine a light on the 
unforeseen yet completely predictable major impacts of federally 
funded scientific research,” said Sudip S. Parikh, chief executive of- 
ficer at AAAS and executive publisher of the Science family of jour- 
nals, in remarks at the ceremony. “AAAS is proud to be a Golden 
Goose Award founding organization and to have the opportunity to 
share with all of you the stories of these incredible scientists whose 
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research went in amazing and valuable directions that will benefit IN EW FR M 
all Americans.” 


This year’s award comes weeks after Congress's passing and e 
President Biden’s signing of the bipartisan CHIPS and Science Act, 
which reauthorizes the key federal agencies whose projects will ( ie | i¢ e 
propel the very innovations honored by the Golden Goose Award. 
The award was launched in 2012, but like much of the research it 
recognizes, its inspiration emerged much earlier. In the 1970s and m\ AAAS 


1980s, the late Senator William Proxmire (D—WI) bestowed the du- 
bious honor of the Golden Fleece Award. His awards were intended 
to highlight wasteful government spending, and basic scientific 
research was often in his crosshairs. Representative Jim Cooper 
(D-TN), however, recognized that esoteric, often silly-sounding 
scientific research can yield significant impacts. To recognize and 
celebrate the scientists pursuing federally funded research with of- 
ten unexpectedly beneficial results, Cooper's vision was brought to 


Subscribe to News 
life by a coalition of businesses, universities, and scientific organiza- from Science fo r 
tions, including AAAS. 


Since its inception, the award has honored more than 70 re- un | Im ited acCCeSs to 
searchers for breakthroughs deriving from sources as wide-ranging . . 
authoritative, 


and seemingly obscure as horseshoe crab blood and rat massage. 
up-to-the-minute 


Also joining the ranks of honorees with the 2022 Golden Goose 
Award are two more teams of researchers. 
Manu Prakash of Stanford University and Jim Cybulski of 


Foldscope Instruments, Inc., earned the award for creating the news on research and 
science policy. 


Foldscope—an innovative new microscope made of folded paper. 
Powerful enough to magnify 140 times, inexpensive to manufac- 
ture, and accessible in conditions where a bulky yet delicate and 


expensive microscope might not be practical, the tool was born of +> 
Prakash's fieldwork in a Thai rainforest studying infectious disease “ 
diagnostics, where a standard microscope was going unused. . 4 


To develop an accessible microscope that could be toted any- 
where, Prakash's lab received support from NIH, while Cybulski 
received the NIH Fogarty International Center Global Health Equity 
Scholars Fellowship and a National Science Foundation grant al- 
lowed them to manufacture and distribute Foldscopes. 

A multinational team consisting of Tibor Juhasz, Ron Kurtz, Detao 
Du, Gérard Mourou, and Donna Strickland also received the Golden 
Goose Award for paving a path to bladeless LASIK eye surgery. 

The serendipitous results stem from Mourou and Strickland’s 
research on short, intense laser pulses—their development of the 
optical technique known as chirped pulse amplification earned 
them half of the 2018 Nobel Prize in Physics. 

When Du was a graduate student in Mourou’s lab—the Center for 
Ultrafast Optical Science at the University of Michigan—he acciden- 
tally lifted his goggles and exposed his eye to a stray beam. While 
Du had no visible damage to his eye, Kurtz, then a medical resident, 
checked his eyes to find a series of perfectly circular laser burns on 
his retina. The damage was not serious, but the laser left much more 
precise marks than the lasers in medical use at the time. Along with 
Juhasz, they collaborated to pursue clinical uses for these femtosec- 
ond laser pulses. Their research was supported by funding from the 
Department of Energy and NSF. 

Today, this laser approach has supplanted the precision scalpel for 
surgery to improve patients’ vision, from which millions have benefited. 
Cooper—the award’s “Father Goose” —retires this year after 32 
years of service in Congress, but his vision of recognizing and cel- 

ebrating that serendipitous science can change lives persists. 

Said Cooper at the award ceremony, “Tell me a fact, and | may re- 
member it. Tell me the truth, and | may believe. But tell me a story, 
and | will carry it in my heart forever. And that’s what Golden Goose 
is all about.” bit.ly/NewsFromScience 


Meredith Asbury, Gwendolyn Bogard, and Haylie Swenson contributed to this report. 
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Edited by 
Michael Funk 


SEA LEVEL CHANGE 


Leveling up and down 


apid melting of ice sheets and glaciers raises the global average sea level 

but does so in a complex pattern of regional increases and decreases called 

a sea level fingerprint. Coulson et al. found that the fingerprint of melting near 

the Greenland Ice Sheet can be detected using ice mass loss estimates made 

from radar altimetry and model reconstructions of nearby glaciers. Their data 
provide an independent confirmation of the acceleration of Greenland Ice Sheet mass 
loss and reveal the impact of this mass flux on sea surface heights around Greenland. 
—HJS Science, abo0926, this issue p. 1550 


New estimates of ice mass loss can provide an independent sea level change fingerprint 
for the Greenland Ice Sheet. 


SUPERCONDUCTIVITY 
Superb graphene 
multilayers 


Graphene bilayers and trilay- 
ers consisting of monolayers 
twisted at just the right angle 
have been shown to be 
superconducting. To acquire a 
unified understanding of super- 
conductivity in these moiré 
superlattices, it is desirable to 
increase the number of known 
graphene moiré superconduc- 
tors. Zhang et al. fabricated 
samples consisting of three, 
four, and five graphene layers 
that were twisted with respect 
to each other in an alternat- 
ing sequence. These graphene 
multilayers were coupled 

to an underlying monolayer 

of tungsten diselenide. The 
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researchers observed robust 
superconductivity in all three 
types of samples, with the 
superconducting portion of the 
phase diagram becoming more 
prominent as the number of lay- 
ers increased. —JS 

Science, abn8585, this issue p. 1538 


MEMBRANES 
Filtering hydrocarbons 


Thin-film composite mem- 
branes synthesized through 
interfacial polymerization ona 
porous supports are used for 
water filtration on an industrial 
scale. Li et al. incorporated 
hydrophobic vesicles into the 
synthesis process to act as 
carriers to bring pure hydrocar- 
bon or fluorinated hydrocarbon 
molecules into the thin top 
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layer of the polymer film. This 
approach boosts the perme- 
ance for hydrophobic solvents, 
enabling the membranes to 
combine high permeance and 
high rejection rates when deal- 
ing with mixtures of organic 
molecules. The authors were 
able to demonstrate the 
separation of realistic crude oil 
mixtures. —MSL 

Science, abq0598, this issue p.1555 


CYTOSKELETON 
Making a mature 
cell skeleton 


Actin molecules are critical 
components of the cytoskeleton, 
controlling processes such as 
cellular organization, organelle 
trafficking, and force generation. 


Actin proteins contain an 
N-terminal acetylated methio- 
nine that gets removed during 
maturation by a specific and 
previously unknown enzyme. 
Haahr et al. identified it as the 
protease ACTMAP, which modi- 
fies all actin molecules in all cell 
types. Ablation of this factor 
converts the actin cytoskeleton 
to a permanent immature state 
that supports life but comes at 
the expense of muscle weakness. 
—SMH 

Science, abq5082, this issue, p.1533 


POLYMER CHEMISTRY 
Making propylene 
from polyethylene 


Polyolefins are the most abun- 
dantly produced plastics but are 
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unfortunately also among the 
hardest to break back down into 
their building blocks. Conk et al. 
developed a preliminary method 
to produce propylene from a 
sequence of reactions between 
waste polyethylene and fresh 
ethylene. Specifically, the pro- 
cess involves a small extent of 
desaturation of each polyethyl- 
ene chain by a dehydrogenation 
catalyst, followed by steady 
breakdown of the chains into 
propylene through catalytic 
isomerization and metathesis 
reactions with ethylene. Subject 
to cost and process optimiza- 
tion, the method shows promise 
for generating a valuable 
feedstock chemical from waste 
plastic. —JSY 

Science, add1088, this issue p.1561 


Eroding Earth’s 
early crust 


The composition of Earth can 
in part be explained by simply 
looking at meteorites that 
did not accrete into planets. 
Despite the broad agreement 
between the composition of 
certain meteorites and Earth, 
differences in isotopes such 
as 142 neodymium suggest a 
more complicated process for 
planetary formation. Frossard 
et al. looked at some of these 
ancient meteorites to disen- 
tangle the origin of the isotope 
differences (see the Perspective 
by Leinhardt). Collisional ero- 
sion of primordial crusts played 
an outside role, which may help 
to explain the total amount of 
more volatile elements in the 
Earth. —BG 

Science, abq7351, this issue p.1529; 

see also add3199, p.1490 


Oncometabolite 
targets T cells 


Cancer-causing mutations in 
isocitrate dehydrogenase cause 
accumulation of the metabolite 
D-2-hydroxyglutarate (D-2HG). 
Notarangelo et al. showed that 
such high concentrations of 
D-2HG could act as a direct 
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inhibitor of lactate dehydroge- 
nase in mouse T cells (see the 
Perspective by Nathan). Inhibition 
of this metabolic enzyme altered 
glucose metabolism in the T cells 
and inhibited their proliferation, 
cytokine production, and ability 
to kill target cells. The authors 
propose that in addition to its 
known cell-autonomous cancer- 
promoting effects, D-2HG may 
also have immunosuppressive 
activity. -LBR 
Science, abj5104, this issue p. 1519; 
see also ade3697, p.1488 


Anew trick for CART cells 


Chimeric antigen receptor 
(CAR) T cells are most com- 
monly thought of as a cancer 
therapy, but there is recent 
work testing their use in treating 
nonmalignant diseases. Seif et 
al. developed and tested a CAR 
T cell construct that targeted 
hyphae of Aspergillus fumiga- 
tus, a ubiquitous mold that 

can cause invasive pulmonary 
aspergillosis in immuno- 
compromised individuals. A. 
fumigatus—specific CAR T 

cells had potent and selective 
antifungal activity in vitro and 

in preclinical animal models, 
supporting further clinical 
development. —CSM 

Sci. Transl. Med. 14, eabh1209 (2022). 


Damaged neurons send 
glia immune signal 


In neurodegenerative diseases, 
neurons accumulate DNA 
double-strand breaks (DSBs). In 
a mouse model of neurodegener- 
ation and in brains from humans 
with Alzheimer's disease, Welch 
et al. showed that DSB-bearing 
neurons elevate senescent and 
antiviral immune responses, 
which in turn elicit microglia 
activation. Although microglia 
are known resident immune cells, 
these findings highlight immune 
responses originating from 
damaged neurons as a distinct 
mechanism of disease-associ- 
ated neuroinflammation. —LG 
Sci. Adv. 10.1126/sciadv.abo4662 
(2022). 


MACHINE LEARNING 


Edited by Caroline Ash 
and Jesse Smith 


Learning how water freezes 


achine learning (ML) is becoming an essential tool 

for modeling physical systems. It all started with a 

cautious attitude because ML comes from a different 

field, but over time, ML has gained more and more trust 

from researchers who bring their knowledge of physics 
to its development. Piaggi et al. present a prime example of 
the successful application of ML to a fundamental physical 
phenomenon, ice nucleation, the initial step in freezing water 
that occurs on time scales longer than traditional first-princi- 
ples simulations can afford. The deep neural network that the 
authors developed to model quantum-mechanical inter- 
atomic forces substantially accelerated molecular dynamics 
simulations of this rare event, propagating it for previously 
unthinkable time spans and thus revealing important atomic- 
level details of what happens when water freezes. —YS 
Proc. Natl. Acad. Sci. U.S.A. 119, e2207294119 (2022). 


Machine learning can be used to reveal atomic-level 
details of ice nucleation, the initial step in freezing water 
(such as in the snow crystal shown). 


Microlensing by an 
isolated black hole 


Black holes emit no light of 
their own, so stellar mass black 
holes have only been identified 
in binary systems. Sahu et al. 
searched for an isolated black 
hole using the astrometric 
microlensing technique, which 
occurs when a foreground black 
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hole (the lens) passes precisely 
in front of a background star 
(the source) as seen from Earth. 
Gravitational lensing causes 

the apparent brightness of 

the source to increase for a 

few months and its apparent 
position to shift by a few milliarc- 
seconds over several years. By 
monitoring the astrometry of a 
microlensing event, the authors 
were able to determine that it 
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was due to an isolated black hole 
of seven solar masses. —KTS 

Astrophys. J. 10.3847/ 

1538-4357/ac739e (2022). 


MICROBIOME 


Impersonating insulin 
Type 1 diabetes (T1D) is a com- 
plex autoimmune disease that 
leads to the destruction of pan- 
creatic beta cells. Production of 
the glucose-regulating hormone 
insulin becomes compromised, 
and T1D usually presents early 
in life. The causes of T1D are 
not clear. However, Girdhar 
et al. have identified a link 
between a human gut microbe 
and T1D onset. The research- 
ers found that the bacterium 
Parabacteroides distasonis 
produced an insulin-like peptide 
with structural similarity to 
human insulin. This molecular 
mimicry tricked the immune 
system into making insulin 
antibodies that then attacked 
the pancreas and enhanced T1D 
onset in an animal model. —PNK 
Proc. Natl. Acad. Sci. U.S.A. 119, 
e2120028119 (2022). 
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ANTIBIOTICS 
Antibiotics shape 
lifetime health 


Premature infants are vul- 
nerable to infection and 
consequently are frequently 
given antibiotics prophylac- 
tically. However, antibiotic 
treatment in the very young 
can interfere with the estab- 
lishment of a complete gut 
microbiota community. Poon et 
al. investigated the longer-term 
consequences of vancomycin 
treatment on neonatal mice. In 
addition to sustained disruption 
to the microbiota, the authors 
also noted changes in gut 
transit times, and found that 
the activity of myenteric and 
submucosal neurons differed 
between treated and untreated 
mice as they reached adult- 
hood. Furthermore, neonatal 
vancomycin treatment was 
associated with lifetime deple- 
tion of mucosal serotonin levels. 
Although male mice showed 
greater myenteric neural 
disruption, serotonin deple- 
tion was seen in both sexes. 
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GENETICS 


Such profound physiological 
changes in response to early- 
life antibiotic treatment may 
have sex-specific and lifetime 
consequences for mammalian 
health. —CA 


J. Physiol. 10.1113/JP282939 (2022). 


ANIMAL COMMUNICATION 
Rock on hyrax! 


Anyone who has been cap- 
tivated by a tune—or more 
precisely by a singer belting out 
a good song—knows the power 
of arhythmic performance. 
Many animals sing in courtship, 
notably birds, but also whales, 
rodents, primates, and hyraxes. 
Demartsev et al. studied hyrax 
song and its rhythms for more 
than a decade and found that 
males that sing more often and 
have more rhythm tend to have 
higher reproductive success. 
That hyraxes sing alone and 
with rhythm suggests that their 
songs are shaped by female 
choice and reflect male qual- 
ity rather than being a group 
coordination. Therefore, sexual 
selection may play a role in the 


Genetics of chimpanzee 
conservation 


enetics can be applied to conservation 
efforts to better understand the popula- 
tion dynamics of endangered species. 
However, sample collection from wild 
species can be invasive and difficult 
to achieve. Fontsere et a/. used noninvasive, 
low-coverage genomic sequence capture of 
fecal samples from 828 wild chimpanzees 
across 48 sites in Africa to illuminate their 
population history. They found that chimpan- 
zee groups have been historically isolated, 
but there is evidence of limited admixture 
between chimpanzee subspecies and their 
close relatives, bonobos. Although it is not a 
complete sampling of chimpanzee popula- 
tions, this catalog of genetic variation can 
also aid in efforts to return poached chimpan- 
zees to their countries of origin, in agreement 
with international standards. —CNS 
Cell Genom. 10.1016/j.xgen.2022.100133 (2022). 


Chimpanzee diversity can be monitored 
by noninvasive fecal sampling. 


evolution of song and rhythm 
more widely. —SNV 
J.Anim. Ecol. 10.1111/ 
1365-2656.13801 (2022). 


ORGANIC CHEMISTRY 
Palladium protection 
during amination 


Palladium catalysis is widely 
used in pharmaceutical and 
agrochemical synthesis to forge 
bonds between carbon and 
nitrogen centers. However, in 
the case of amination adjacent 
to olefins, the amine reactants 
often disrupt the catalysis by 
coordinating more strongly 
to the metal than the olefins. 
Jin et al. report that the use of 
bidentate phosphine ligands 
can protect palladium from 
deactivation by amines dur- 
ing oxidative allylic amination. 
Simulations suggested that the 
benzoquinone oxidant further 
protects the metal by weakening 
the amine's coordination affinity 
through hydrogen bonding. —JSY 
Nat. Chem. 10.1038/ 
$41557-022-01023-x (2022). 
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HUMAN DISEASE 
Vanquishing 
vexing vectors 


Many major human and animal 
diseases are transmitted by 
blood-feeding insects and 
ticks. Because pathogens are 
ingested when these vectors 
feed, interactions with the vec- 
tor’s gut microbiome, midgut 
secretions, and epithelial cells 
are key factors in the transmis- 
sion of disease. Barillas-Mury 
et al. reviewed recent research 
bolstering our understand- 
ing of how pathogens interact 
with both vectors and their 
microbiomes, as well as with 
vector-derived factors and 
vertebrate hosts after vector 
bites. The authors also discuss 
how these advances are being 
translated into next-generation 
strategies to control natural vec- 
tor populations and inform the 
development of biomarkers of 
vector exposure. —STS 
Science, abc275/7, this issue p.1507 


AGING 
Long live the mice 
(and humans) 


The question of what determines 
our longevity has preoccupied 
humans throughout the ages, 
and different researchers have 
tried to address this in a variety 
of ways. Bou Sleiman et al. 
approached this question on 
a large scale, examining the 
growth patterns and genetics of 
thousands of mice from a mul- 
ticenter study of aging (see the 
Perspective by Magalhdes). From 
this analysis, the authors identi- 
fied the contributions of various 
genes to the animals’ longevity, 
which differed between male and 
female mice. The genes involved 
were well conserved between 
species, as evidenced by the find- 
ing of similar effects in humans 
and worms. In addition to genet- 
ics, early-life nutrition appeared 
to play a major role. —YN 

Science, abo3191, this issue p. 1508; 

see also ade3119, p. 1409 
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IMMUNE SIGNALING 
Production of small 
signaling isomers 


Isomers of the small signal- 
ing molecule cyclic ADP ribose 
(cADPR) are produced when 
bacteria and plants respond to 
pathogens. Manik et al. found 
that cyclization is managed 
by Toll/interleukin-1 receptor 
(TIR) domain proteins that 
are themselves activated by 
conformational changes. One 
cyclic isomer, 3'cADPR, activates 
bacterial antiphage defense sys- 
tems in bacteria and suppresses 
immunity in plants. —PJH 
Science, adc8969, this issue p.1501_ 


DRUG DESIGN 
A path to pain relief 


The serious problems associ- 
ated with opioid addiction 
have motivated the search for 
non-opioid pain-relief drugs. 
The w2A-adrenergic receptor 
(a2AAR) is a validated pain 
receptor that is targeted by the 
drug dexmetadomine. This drug 
is used in hospitals but is unsuit- 
able for broader use because 
it causes sedation and is not 
orally bioavailable. Fink et al. 
screened more than 300 million 
virtual molecules and identi- 
fied agonists that bind a2AAR 
with reasonable affinity and are 
structurally unrelated to known 
agonists. Crystal structures of 
two of the compounds bound to 
a2AAR allowed optimization to 
improve potency. The optimized 
compounds were effective in a 
neuropathic pain model without 
causing sedation, making them 
promising leads for further 
development. —VV 
Science, abn7065, this issue p,1509 


NEUROEVOLUTION 
Primate brain diversity 


Primate brains benefit from an 
evolutionary specialization of the 
cortex that supports cognition. 
Ma et al. tracked transcriptomes 
from cellular nuclei sampled 
from human, chimpanzee, 


macaque, and marmoset brain 
cortices. The transcriptomes 
define groups of cell types, some 
of which are conserved and 
some that differ between these 
species. Several human-specific 
specializations were identified. 
Expression patterns of more 
than 900 genes associated with 
brain disorders revealed a variety 
of conserved, divergent, group- 
specific patterns. —PJH 

Science, abo725/, this issue p. 1511 


MARTIAN GEOLOGY 
Igneous rocks 
in Jezero crater 


The Perseverance rover landed 
in Jezero crater on Mars in 
February 2021. Farley et al. 
describe the geologic units 
investigated by the rover as it 
began to traverse the crater 
floor, based on images and spec- 
troscopy. The authors found that 
the rocks are of igneous origin, 
later modified by reactions with 
liquid water. They also describe 
the collection of drilled samples 
for potential return to Earth by 
another spacecraft. Liu et al. 
present compositional data for 
these igneous rocks based on 
x-ray fluorescence measure- 
ments. They found similarities 
with some Martian meteorites 
and conclude that the igneous 
rocks formed from crystals that 
sank in a thick sheet of magma. 
Together, these studies constrain 
the history of Jezero crater and 
provide geological context for 
analysis of the drill samples. 
—KTS 

Science, abo2196, abo2756 

this issue, p.1512 p.1513 


OCEAN ACIDIFICATION 
Acceleration in the Arctic 


The Arctic is warming at a rate 
faster than any comparable 
region on Earth, with a conse- 
quently rapid loss of sea ice 
there. Qi et a/. found that this sea 
ice loss is causing more uptake 
of atmospheric carbon dioxide 
by surface water and driving 
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rapid acidification of the western 
Arctic Ocean at a rate three to 
four times higher than that of the 
other ocean basins. They attri- 
bute this finding to melt-driven 
addition of freshwater and the 
resulting changes in seawater 
chemistry. —HJS 

Science, abo0383, this issue p. 1544 


METHANE EMISSIONS 
Fueling global warming 


Flaring, the process of burn- 
ing natural gas escaping from 
oil and gas wells, is primar- 
ily intended to combust the 
powerful greenhouse gas 
methane to minimize its emis- 
sion. But is flaring as effective 
as is claimed? Plant et al. used 
airborne sampling to measure 
flare efficiency in three major 
gas production regions in the 
United States and found that 
methane emissions were five 
times higher than currently 
thought (see the Perspective by 
Duren and Gordon). Therefore, 
flaring is often not as efficient 
as presumed—or methane 
plumes simply are not com- 
busted at all. —HJS 

Science, abq0385, this issue p. 1566; 

see also ade2315, p.1486 


BIOCHEMISTRY 
Protecting PKC-y for 
neurodegeneration 


Mutations in protein kinase 
Cy (PKCy) cause a hereditary 
neurodegenerative disease 
called spinocerebellar ataxia 
type 14. Pilo et al. found that 
these mutations increase basal 
activity by disrupting autoinhibi- 
tory contacts within the kinase. 
This structural change also 
impairs quality control pathways 
that would normally lead to the 
degradation of aberrant PKC. 
Transgenic mice expressing 
a disease-associated mutant 
exhibited ataxia-associated loss 
of Purkinje cells and increased 
phosphorylation of PKCy target 
proteins in the cerebellum. —LKF 
Sci. Signal. 15, eabk1147 (2022). 
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IMMUNODEFICIENCY 
Subpar sustenance 
for lymphocytes 


WHIM syndrome, a human 
inborn error of immunity 
featuring reduced numbers 

of circulating lymphocytes, 

is caused by gain-of-function 
mutations in the chemokine 
receptor CXCR4. Zehentmeier 

et al. investigated stromal cell 
abnormalities contributing to 
lymphopenia by analyzing gene 
expression by bone marrow 
mesenchymal stem cells (MSCs) 
in amouse model of a WHIM- 
inducing mutation. These cells 
exhibited reduced expression of 
the gene for interleukin-7 (IL-7), 
acytokine critical for early B cell 
differentiation because of excess 
signaling through the lympho- 
toxin beta receptor (LTBR). 
Reduced expression of the genes 
for IL-7 and B cell—activating fac- 
tor by lymph node stromal cells 
contributed to the lymphopenia. 
These findings demonstrate that 
CXCR4 and LTBR are viable tar- 
gets for therapies to ameliorate 
the immunodeficiency of WHIM 
syndrome. —IRW 


Sci. l!mmunol. 7,eabo3170 (2022). 
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HUMAN DISEASE 


Understanding pathogen survival and transmission 
by arthropod vectors to prevent human disease 


Carolina Barillas-Mury*+{, José M. C. Ribeiro*}, Jesus G. Valenzuela*+ 


BACKGROUND: Many endemic human and ani- 
mal diseases caused by viruses, bacteria, protozoa, 
or nematodes are transmitted by blood-feeding 
insects or ticks. Because pathogens are ingested 
with the blood meal, their interactions with the 
vector’s gut microbiome, midgut secretions, and 
epithelial cells are key determinants of disease 
transmission. Most pathogens must infect the 
vector and multiply to be transmitted, and this 
amplification greatly increases their chances 
of infecting a new vertebrate host. Accordingly, 
how pathogens survive or invade the midgut, 
and their interactions with the invertebrate im- 
mune system, are areas of great interest in vec- 
tor biology. Blood-feeding arthropods secrete 
saliva while probing for blood, and saliva from 
many different vectors has been shown to have 
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antihemostatic, anti-inflammatory, and im- 
munomodulatory activities that allow success- 
ful blood feeding. Other vector-derived factors, 
including exosomes, egested microbiota, and 
pathogen-derived molecules, are injected at 
the bite site where pathogens are delivered 
and modulate the local environment and the 
innate immune responses of the host skin, 
promoting pathogen establishment and dis- 
ease progression. Thus, functional studies of 
vector saliva and other vector-derived factors, 
their roles in pathogen establishment and trans- 
mission, the potential use of specific protein 
targets to prevent vector-borne diseases, and 
the use of vector salivary proteins as biomark- 
ers of exposure to vector bites are also of great 
importance. 
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Interaction of pathogens with arthropod vector molecules promotes pathogen survival and transmission. 
Vector-borne pathogens interact with the gut microbiota and with secreted molecules and midgut receptors 
necessary for survival or for cell invasion. This allows them to circumvent the vector immune system to 
avoid elimination, multiply, and be transmitted. Pathogens also interact with vector-secreted molecules that 
are coinjected into the skin of the vertebrate host during disease transmission by the bite of an infected 
vector. These vector-derived factors—which include salivary proteins, parasite exosomes, parasite molecules, 
and egested microbiota—modify the hemostatic and innate immune response of the host skin, resulting 

in enhanced pathogen transmission. 
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ADVANCES: Over the past 20 years, the study of 
vector biology has been revolutionized by many 
new molecular, cell biology, and genomic tools. 
For example, the advent of genomics and third- 
generation sequencing has allowed for high- 
density de novo transcriptome assembly and 
single-cell RNA transcriptome analysis, even 
without a reference genome. Perturbations in 
vector gene expression in response to infection 
with specific pathogens can now be studied at a 
genome-wide level. The expression of recombi- 
nant proteins derived from genomics or tran- 
scriptomic studies has enabled structure-function 
studies, the discovery of salivary biomarkers to 
measure vector exposure in humans, and the 
generation of specific antibodies for subcellular 
localization of pathogen receptors or of vector- 
derived proteins that interact with specific path- 
ogens. Transient gene silencing has revealed 
the participation of specific genes or signal- 
ing pathways in both vector physiology and in 
vector immunity against pathogens. There have 
also been great advances in the genetic ma- 
nipulation of insects with the CRISPR-Cas9 
system and the development of gene-driven 
strategies, such as CRISPR-Cas9-directed ho- 
mologous repair. The symbiotic proteobacteria 
Wolbachia has been introduced into natural 
populations of Aedes aegypti mosquitoes to 
reduce viral replication and transmission of 
dengue, Zika, and Chikungunya with promis- 
ing results. Such approaches may hold the long- 
awaited promise of permanently modifying 
natural vector populations to render them un- 
able to transmit disease. 


OUTLOOK: In this Review, we highlight recent 
progress in our understanding of the basic 
molecular and cell biology of pathogen inter- 
actions with vectors and their microbiomes, as 
well as the interactions between vector-derived 
factors and pathogens with their vertebrate 
hosts during the establishment of infection after 
a vector bite. We discuss how these advances in 
the biology of disease transmission have also 
translated into next-generation strategies to 
control natural vector populations and to mod- 
ify them to disrupt their capacity to transmit 
disease. Moreover, we review the development 
of salivary biomarkers of vector exposure as a 
tool for evaluating both epidemiological studies 
and vector control strategies by measuring vec- 
tor biting activity in humans. 
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HUMAN DISEASE 


Understanding pathogen survival and transmission 
by arthropod vectors to prevent human disease 


Carolina Barillas-Mury*+{, José M. C. Ribeiro*}, Jesus G. Valenzuela*+ 


Many endemic poverty-associated diseases, such as malaria and leishmaniasis, are transmitted by 
arthropod vectors. Pathogens must interact with specific molecules in the vector gut, the microbiota, 
and the vector immune system to survive and be transmitted. The vertebrate host, in turn, is infected 
when the pathogen and vector-derived factors, such as salivary proteins, are delivered into the skin 
by a vector bite. Here, we review recent progress in our understanding of the biology of pathogen 
transmission from the human to the vector and back, from the vector to the host. We also highlight 
recent advances in the biology of vector-borne disease transmission, which have translated into 
additional strategies to prevent human disease by either reducing vector populations or by disrupting 


their ability to transmit pathogens. 


lood-feeding insects are vectors of some 

of the most devastating diseases in low- 

income areas, such as malaria, Zika, den- 

gue, and leishmaniasis. Vector-pathogen 

interactions, particularly those mediated 
by the midgut, the gut microbiota, and the 
vector immune system, are intense areas of 
study because they are major determinants 
of disease transmission by limiting patho- 
gen survival. Molecular interactions between 
vector-derived factors—such as salivary pro- 
teins, exosomes, and egested microbiota—and 
the mammalian host modulate local skin im- 
munity and are critical for pathogen establish- 
ment. Thus, it is very important to understand 
how saliva and other vectors or pathogen- 
derived factors affect disease transmission 
and to understand their potential as vaccine 
targets to prevent diseases or as biomarkers 
of vector exposure. 


Interaction of pathogens with vector 
midgut proteins 


Ingested pathogens need to survive the harsh 
environment of the midgut lumen, where 
blood digestion takes place. The blood meal is 
surrounded by a chitinous peritrophic matrix 
(PM), which prevents direct contact between 
the gut microbiome and epithelial cells (Fig. 1). 
In Anopheles gambiae mosquitoes, the perme- 
ability of the ectoperitrophic space (between the 
PM and the midgut epithelium) is tightly mod- 
ulated by secretion of an immune-modulatory 
peroxidase (IMPer) (7). This enzyme cross-links 
proteins in the ectoperitrophic space, allowing 
commensal bacteria to proliferate in the gut 
lumen without eliciting epithelial immunity. 
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This also allows early stages of Plasmodium to 
develop within the gut without detection. In 
tsetse flies, Trypanosoma brucei takes advan- 
tage of the immature and fluid PM to move 
into the ectoperitrophic space. There, it colo- 
nizes the proventriculus to later establish mid- 
gut infection (2). 

Some pathogens must bind to specific midgut 
receptors to survive. Leishmania major para- 
sites, for example, are covered with lipophos- 
phoglycans that specifically bind to PpGal, a 
sand fly midgut galectin receptor required 
for parasite survival and development (Fig. 1) 
(3). The Borrelia surface protein OSP A binds 
to TROSPA, a receptor in the gut of the Ixodes 
scapularis tick (Fig. 1), an interaction required 
for tick colonization by spirochetes (4). Thus, 
arthropod midgut receptors are attractive poten- 
tial vaccine targets to block disease transmission. 


Effect of gut microbiota and multiple blood 
meals in vectorial capacity 


Commensal gut bacteria proliferate extensively 
as the blood meal undergoes digestion and af- 
fect pathogen establishment. In_.A. gambiae, the 
mosquito gut microbiota limits Plasmodium 
infection, and oral antibiotics increase Plasmo- 
dium survival (5). By contrast, the gut micro- 
biota of sand flies is essential for Leishmania, 
and antibiotics inhibit their development into 
infective metacyclics (6). The close interaction 
of pathogens with the insect gut microbiota has 
been exploited by using an enterobacterium 
isolated from wild-caught Anopheles to limit 
Plasmodium development in mosquitoes (7) 
and Chromobacterium sp. from field-caught 
Aedes to impair Plasmodium and dengue virus 
infection (8). 

Recent findings show that ingestion of multi- 
ple blood meals can enhance pathogen trans- 
mission by infected vectors (Fig. 1). When 
Leishmania-infected sand flies take a second 
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blood meal, parasites dedifferentiate into a 
multiplicative stage called retroleptomonad, 
which expands the population of infective 
metacyclics in the vector gut (9). Similarly, 
dengue virus-infected mosquitoes fed a sec- 
ond uninfected blood meal exhibit higher 
systemic virus dissemination that enhances 
viral transmission (J0). This increase in vec- 
torial capacity after the ingestion of blood 
from a healthy host may explain why disease 
transmission is so effective in the field. 


Insect innate immunity and vector competence 


Antiplasmodial responses of anopheline mos- 
quitoes can limit malaria transmission. In 
A. gambiae, these responses involve the se- 
quential and coordinated activation of epithe- 
lial immunity by invaded midgut cells (77-75), 
microvesicle release by circulating hemocytes 
(6, 17), and activation of the mosquito com- 
plement system (15, 78). Genetic studies have 
revealed that Pfs47, a protein on the surface 
of the parasite Plasmodium falciparum, me- 
diates immune evasion (19, 20) by interacting 
with a receptor in the mosquito midgut (27). 
The receptors have diverged in evolutionarily 
distant mosquitoes (27, 22) and constantly se- 
lect for parasites with a Pfs47 haplotype com- 
patible with their receptor (22). 

Antiviral immunity in Drosophila and mos- 
quitoes is mediated by the RNA interference 
(RNAi) pathway, whereas the Piwi-interacting 
RNA (piRNA) pathway appears to be active 
only in mosquitoes (23-25). Silencing key com- 
ponents of the RNAi pathway in Aedes aegypti 
mosquitoes increases Dengue (26) and Sindbis 
virus replication (27) and, in A. gambiae, en- 
hances the infection and dissemination of 
O’nyong-nyong virus (28). 

piRNAs range in size between 26 and 31 base 
pairs and have been mostly linked to the si- 
lencing of retrotransposons (29) and other ge- 
netic elements in the germ line (30). However, 
an antiviral role in mosquito cell lines has 
been proposed, where viral DNAs (vVDNAs) are 
integrated into the A. aegypti genome as en- 
dogenous viral elements (EVEs) and serve as 
templates to produce piRNAs (37-33). EVEs may 
therefore represent a type of immune memory 
by providing a record of past infections (32). 
EVE-derived piRNAs are specifically loaded 
onto Piwi4 and inhibit virus replication (33). 
Inhibition of VDNA in Aedes albopictus and A. 
aegypti mosquitoes infected with two arbovi- 
ruses (Dengue or Chikungunya) reduces viral 
small RNAs and results in high susceptibility 
to viral infection and loss of tolerance. Thus, 
vDNA is an important mechanism for per- 
sistent viral infections in mosquitoes, which 
makes these insects highly efficient disease 
vectors (34). 

Furthermore, overactivation of the JAK-STAT 
pathway decreases A. aegypti susceptibility to 
dengue virus infection, whereas silencing key 
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Fig. 1. Pathogens ingested by arthropod vectors interact closely with the gut microbiota and with midgut epithelial cells. They traverse the PM, and some 
interact with specific receptors, such as P. papatasi galectin (PpGal), |. scapularis TROSPA, and the P. falciparum Pfs47 receptor. Epithelial cells activate antiviral responses 
mediated by the piRNA, small interfering RNA (siRNA), and JAK-STAT pathways. At the same time, caspase-mediated nitration responses of ookinete-invaded midgut 
cells activate hemocyte microvesicle release, which promotes the activation of complement-mediated Plasmodium elimination. Ingestion of a second blood meal can 
also provide nutritional and/or differentiation signals that enhance pathogen transmission. LPGs, lipophosphoglycans. 


components for activation of the pathway en- 
hances susceptibility to infection (35). Activa- 
tion of the Toll pathway and the gut microbiota 
reduces A. aegypti susceptibility to dengue in- 
fection (36), whereas constitutive activation 
of the immunodeficiency (IMD) pathway de- 
creases microbiota bacterial loads and increases 
Sindbis virus infection (37). By contrast, elimi- 
nation of the gut microbiota in A. gambiae 
decreases infectivity with O’nyong-nyong virus 
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(38). The A. gambiae midgut activates local anti- 
viral responses to O’nyong-nyong virus infection, 
which involve the IMD and JAK-STAT signaling 
pathways, whereas the RNAi pathway mediates 
systemic responses (38). Loquacious 2 (Loqs2) is 
required to control systemic replication of den- 
gue virus and Zika virus in A. aegypti. Logs2 is 
not expressed in the midgut, but ectopic midgut 
expression in transgenic mosquitoes limits den- 
gue virus replication and dissemination (39). 
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Innate immune priming 

Insect immunity relies on innate immune re- 
sponses that were thought to be hardwired. 
However, studies in A. gambiae have revealed 
that mosquitoes previously exposed to Plas- 
modium infection mount a more effective re- 
sponse to subsequent infections (7). The priming 
response is established when ookinete inva- 
sion allows direct contact of the gut micro- 
biota with midgut cells, activating prostaglandin 
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synthesis mediated by two heme peroxidases 
(HPx7 and HPx8) (/6). This triggers the systemic 
release of a hemocyte differentiation factor 
(HDF) that increases the proportion of circu- 
lating granulocytes (7). HDF is a complex of 
lipoxin A, bound to Evokin, a carrier from 
the lipocalin family. Immune priming involves 
increased ability of mosquitoes to synthesize 
lipoxins, predominantly lipoxin A, (40). Primed 
hemocytes then release more microvesicles in 
response to ookinete invasion, which enhances 
the activation of the mosquito complement 
system (17). 


Vector saliva 


The saliva of blood-feeding arthropods has anti- 
platelet, anticlotting, and vasodilatory proper- 
ties and contains modulators of inflammation 
and immunity (41, 42) (Fig. 2). Despite similar 
functions, these proteins are highly divergent 
among insect species because blood feeding 
evolved independently, and these genes evolve 
rapidly. For example, most vectors that feed on 
mammalian blood have high levels of adenosine 
5'-diphosphate (ADP)-hydrolyzing enzymes to 


Vector-derived factors interactions with host skin 
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Fig. 2. In addition to pathogens, blood-feeding 
arthropods deliver a series of vector-derived 
factors, such as salivary components and gut 
microbiota, and pathogen components, such as 
exosomes and proteophosphoglycans, into the 
host skin. Vector-derived factors counteract the 
hemostatic system and modulate host skin immu- 
nity, including activation of the inflammasome, 


which favors the establishment of some parasitic 
and viral infections. 
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inhibit platelet aggregation, but these levels 
are achieved using different strategies. A 5’- 
nucleotidase family exists in ticks (43), triatomine 
bugs (44), and mosquitoes (45); a CD39 family 
is present in fleas (46); and a Cimex-type apyrase 
is found in sand flies (47) and bed bugs (48). 
The most potent vasodilator known, the pep- 
tide maxadilan, was discovered in the sand fly 
Lutzomyia longipalpis (49). Several “kratagonists” 
(from the Greek “kratos” meaning “to seize”) in 
vector saliva, such as lipocalins and odorant- 
binding proteins, act as scavengers of histamine, 
serotonin, leukotrienes, thromboxane A», and 
inflammatory cytokines (50), whereas other 
protein families inhibit the activation of the 
complement system (51-54). 

The advent of third-generation sequencing 
has revealed the existence of many previously 
unknown salivary proteins—~50 different sali- 
vary polypeptides in sand flies, >100 in mos- 
quitoes and triatomine bugs, and thousands 
in hard ticks (55, 56). The function of most of 
these proteins remains unknown. The increased 
number of salivary transcripts in ticks is the 
result of a large expansion of several protein 
families. However, only a few members of each 
family are expressed at a given time, and the 
composition of saliva changes over time (57). 
This phenomenon, which we refer to as sialome 
switching, may be an immune evasion strategy, 
but how it is regulated remains a mystery. 


The effect of vector saliva and vector-derived 
factors on pathogen establishment 


Needle inoculation of pathogens does not mimic 
their natural delivery by an insect bite. For ex- 
ample, West Nile virus delivered to mice by 
mosquito Culex tarsalis bites results in higher 
viremia and neuroinvasion compared with 
needle inoculation (58). Likewise, injection of 
Semliki Forest virus in a skin site previously 
bitten by Aedes mosquitoes results in higher 
viremia and increased mortality in rodents 
(59), mediated by interleukin-18 (IL-18) activa- 
tion and recruitment of neutrophils to the 
dermis (59). Similarly, Leishmania donovani 
transmission by sand fly bites induces the 
activation of the inflammasome, production 
of IL-18, recruitment of neutrophils, and par- 
asite visceralization (Fig. 2). By contrast, no 
parasite visceralization occurs with needle- 
injected parasites (60). Activation of IL-1B and 
neutrophil recruitment was triggered by the 
gut microbiota egested during the sand fly bite 
(60). Plasmodium sporozoites injected at the 
bite site by Anopheles stephensi result in higher 
parasitemias, enhanced progression to cerebral 
malaria, and death (61). SALP15, a tick salivary 
protein, is essential for establishment and dis- 
semination of infection by inhibiting T cell 
activation and protecting Borrelia from the 
host complement (62). Thus, vector-derived 
factors from different disease vectors appear 
to activate innate immune responses and to 
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recruit cells, such as neutrophils, that facil- 
itate pathogen establishment (Fig. 2). 

There are other mechanisms by which vector 
saliva facilitates pathogen transmission. For 
example, the sand fly salivary endonuclease 
LunDep destroys neutrophil extracellular traps 
(NETs), increasing survival of L. major par- 
asites (63). The sand fly salivary hyaluronidase 
enhances Leishmania infection in mice by 
working as a so-called spreading factor and by 
increasing neutrophil recruitment to the bite 
site (64, 65). The A. aegypti salivary protein NeSt1 
increases Zika virus pathogenesis by activating 
neutrophils (66), whereas the salivary protein 
AaVA-1 enhances Zika and Dengue transmis- 
sion in rodents by activating autophagy in im- 
mune cells (67). In addition to salivary proteins, 
Leishmania proteophosphoglycans and exo- 
somes as well as egested microbiota are part of 
the infectious inoculum of sand flies (60, 68, 69). 


Arthropod salivary proteins as vaccine targets 
to prevent disease transmission 


Arthropod salivary proteins are promising vac- 
cine candidates to disrupt transmission of vector- 
borne diseases (70, 71). Vaccination of rhesus 
macaques with the sand fly salivary protein 
PdSP15 confers protection from L. major trans- 
mitted by sand fly bites, which correlates with 
the development of a PdSP15-specific type 1 T 
helper (Ty1) cell immunity and anti-Leishmania 
immunity (72). Ty1 immunity to the salivary 
protein coinjected at the bite site is thought to 
protect from disease by altering the immune 
response to parasite antigens (72). Recently, 
immunization of guinea pigs with an mRNA 
vaccine encoding selected tick salivary pro- 
teins induced a skin response that prevented 
Borrelia transmission by ticks in vaccinated 
animals (73). In mosquitoes, vaccination of mice 
with Anopheles salivary protein AgTRIO pro- 
tects animals from Plasmodium infection (74), 
whereas immunization with the A. aegypti sal- 
ivary protein AgBR1 protects mice from Zika (75) 
and West Nile virus infections (76). Antibodies 
to AgBR1 prevent AgBR1-induced neutrophil ac- 
tivation and block saliva-mediated enhance- 
ment of virus establishment (75). Recently, a 
mosquito salivary peptide-based vaccine was 
shown to be safe and to produce specific im- 
munity in a phase 1 clinical trial in humans (77). 


Arthropod salivary proteins as biomarkers 
of vector exposure 


Humans mount antibody responses to certain 
salivary proteins that can be used as biomark- 
ers of exposure to vector bites (78). Several re- 
combinant vector salivary proteins and synthetic 
peptides have been developed as biomarkers, 
including the gSG6-P1 salivary peptide from 
A. gambiae (79), Nterm-34kDa salivary peptide 
from A. aegypti (80), rLJM17 and rLJM11 from 
L. longipalpis (81), rSP03B from Phlebotomus 
perniciosus (82), PpSP32 from Phlebotomus 
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papatasi (83), Tsgf118-43 from tsetse flies (84), 
rTisP14.6 from Triatoma infestans (85), and 
Calreticulin from J. scapularis (86). These are 
useful tools to assess risk of disease transmis- 
sion and the effectiveness of vector control 
strategies (87) in ongoing disease elimination 
campaigns. 


Transgenesis, paratransgenesis, and other 
vector control strategies 


Genetic modifications can be used to reduce 
natural vector populations or to replace them 
with transgenic vectors that no longer trans- 
mit disease (Fig. 3). Two major obstacles to 
this approach have been overcome: first, the 
identification of gene targets to be disrupted 
or previously undiscovered genes that would 
interrupt pathogen development, and second, 
strategies to introduce a new gene or to mod- 
ify a vector gene. However, once released, trans- 
genic insects will mate with wild-type ones, and 
the genetic modifications of interest will dilute 
in every generation (88) unless they can spread 
more effectively through non-Mendelian in- 
heritance that can drive the gene through the 
population. 

Several gene drive mechanisms have been 
proposed, such as transposable elements (88) 
and other gene insertion methods (89), includ- 
ing arboviruses and the homing endonuclease 
system, with various degrees of success. More 
recently, the CRISPR-Cas9 system has emerged 
as a very promising approach. CRISPR-Cas9- 
directed homologous repair of the germ line 
cuts the gene in the wild-type chromosome 
of heterozygous mosquitoes and repairs it by 


homologous repair with the chromosome con- 
taining the effector gene of interest (90). A 
great variety of genes have been proposed as 
insertion targets, including expression of recom- 
binant antibodies that target the pathogen (91) 
or that affect mosquito reproduction (92-94). 
However, a word of caution for constructs af- 
fecting fertility is needed. A partial decrease of 
fertility may ultimately lead to more rather than 
fewer mosquitoes because it will reduce competi- 
tion for resources during larval stages. Emerging 
adults may also be better nourished and live 
longer, resulting in adults with increased vecto- 
rial capacity (95). This problem can be circum- 
vented by strategies that are deleterious to 
adult stages but allow normal larval develop- 
ment (96). 

An alternative strategy to modify vectors is 
to introduce commensal microorganisms that 
reduce vectorial capacity (Fig. 3) (97). A Serratia 
strain that quickly colonizes the mosquito gut 
when fed in a sugar solution was recently iden- 
tified that, when engineered to secrete anti- 
Plasmodium effector proteins, greatly reduces 
the ability of mosquitoes to transmit malaria 
(97). Rhodococcus rhodnii, a triatomine endo- 
symbiont, has been modified to express an anti- 
Trypanosoma peptide. This microbe rendered 
Rhodnius prolixus resistant to Trypanosoma 
cruzi, the parasite that causes Chagas disease 
(98). In this system, symbionts are transmitted 
from adults to nymphs by coprophagy. Geneti- 
cally modified R. rhodnii has been used to prepare 
a mixture that is actively ingested by triatomine 
nymphs (99). Another successful development 
has been the exploitation of Wolbachia that nat- 


Disrupting transmission of vector-borne diseases 


Suppress vector 
population 


Transgenic vectors that disrupt 
reproduction in progeny 


Entomopathogenic fungi 
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urally infect insects (100). This is an intracellular 
bacterium that is transmitted vertically and 
spreads through natural insect populations by 
inducing cytoplasmic incompatibility (CI). In- 
fected females have viable offspring when they 
mate with either noninfected or infected males, 
whereas Wolbachia-free females are only fer- 
tile if they mate with uninfected males. When 
A. aegypti mosquitoes—the vectors of dengue, 
Zika, and Chikungunya—are infected with some 
strains of Wolbachia, this greatly reduces vi- 
ral replication and transmission (J07). This 
strategy is already undergoing field trials in 
several countries to disrupt transmission of 
arboviruses (102, 103). A Metarhizium fungus 
genetically modified to express an insect toxin 
was very effective in suppressing natural pop- 
ulations of mosquitoes highly resistant to in- 
secticides (104). A different strategy, where the 
antimalarial atovaquone was applied to sur- 
faces with which mosquitoes came into contact, 
disrupted Plasmodium development, providing 
an additional tool to prevent malaria transmis- 
sion (105). 


Future directions 


Most blood-feeding arthropods eliminate the 
pathogens they ingest and do not transmit dis- 
ease. However, there are extremely rare, fortu- 
itous encounters in which a pathogen interacts 
with vector factors that allow it to survive and/or 
evade elimination by the vector immune sys- 
tem and infect a new host when the vector takes 
the next blood meal. This ability to jump or fly 
from one host to another gives these patho- 
gens an enormous advantage, with devastating 
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Fig. 3. Next-generation strategies to disrupt pathogen transmission by arthropod vectors include suppression of vector populations or elimination of the 
pathogens in the vector. Effective entomopathogenic fungi expressing insect toxins or genetic modifications deleterious to specific stages of the vectors have 
been developed to suppress vector populations. Pathogens can be eliminated by genetically modified arthropods or commensal bacteria expressing effectors against 
the pathogen; by introducing a commensal, such as Wolbachia, that creates a physiological state nonpermissive to the pathogen; or by exposing the vector to 
drugs that target the pathogen, such as contact of mosquitoes with antimalarial drugs that eliminate Plasmodium parasites. 
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consequences for human and animal health. 
There is growing consensus that disrupting 
disease transmission by arthropod vectors is 
critical for disease eradication. As we continue 
to harness our growing understanding of the 
biology of pathogen transmission, we uncover 
critical interactions that can be targeted to 
prevent disease. New translational tools are con- 
stantly being developed that may allow us to 
witness the long-awaited eradication of some of 
the most devastating vector-borne diseases. 
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INTRODUCTION: Aging is the progressive decline 
in physical, mental, and reproductive capaci- 
ties that is accompanied by multiple morbid- 
ities and associated with increased mortality. 
Despite advances in identifying aging path- 
ways and drugs that extend life span in model 
systems, an integrative understanding of the 
interplay between genetics, sex, and environ- 
ment in aging and life-span determination 
is largely lacking. 


RATIONALE: Although there is no single mea- 


sure of aging, studies depend on surrogate or 
related traits such as longevity, life history, 


GO Gl G2 
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age-related disease onset, and physiological 
markers. Characterizing genetic and non- 
genetic determinants of longevity at the popu- 
lation level may identify genes and pathways 
involved in aging, providing avenues for tar- 
geted anti-aging therapies and extension of 
healthy longevity. We set out to query genetic 
regulators of longevity in a total of 3276 
UM-HETS3 mice used in longevity studies by 
the National Institute on Aging’s Interventions 
Testing Program (NIA ITP). We interrogated 
whether the genetic basis of longevity is sex 
and age dependent, and whether nongenetic 
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access to nutrients on growth contribute to 
longevity determination. We characterized the 
age- and genotype-dependent changes in liver 
gene expression in mice from the same genetic 
cross. Finally, we integrated these results with 
orthogonal datasets to obtain a resource of 
prioritized candidate genetic loci and genes 
for further investigation. 


RESULTS: When jointly analyzing males and 
females, we obtained a single, previously 
described, longevity locus on chromosome 
12. However, when analyzed separately, males 
and females had distinct genetic determi- 
nants of longevity. In females, a single locus 
on chromosome 3 was uncovered, whereas 
in males, loci were detected only when early 
deaths were excluded, suggesting that some 
genetic variations had an effect on longevity 
beyond a certain age. Increased body weight 
associated with earlier death and some of the 
variation in adult body weight are explained 
by litter size. Hence, early access to nutrients 
may affect mouse longevity through its effect 
on growth. We used Mendelian randomization 
to replicate the relationships between early 
growth, adult size, and longevity in humans. 
To prioritize genes under the longevity loci, 
we profiled liver gene expression of adult and 
old mice to look for sex-, age-, and genetically 
driven differences in expression. Female livers 
had higher interferon-related gene expression, 
and older mice had overexpressed immune- 
related genes. Genetic regulation of gene 
expression was assessed, with the majority 
being conserved across sexes and age. We 
combined our results with data from multiple 
sources in model organisms and humans to 
compile an interactive resource for conserved 
longevity gene prioritization (https://www. 
systems-genetics.org/itp-longevity). Worm life- 
span experiments validated some of the most 
highly scoring genes and identified Hipk1, 
Ddost, Hspg2, Fgd6, and Pdk1 as candidates. 


CONCLUSION: This study provided insights into 
determinants of longevity, highlighting genetic 
mediators that can be sex or age specific, and 
nongenetic effects such as early access to nu- 
trients. The combined body of information as- 
sembled from this study and the external data 
constitute a hypothesis-building resource for 
future studies on, and therapies for, aging, age- 
related disease, and longevity. 
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DNA variants that modulate life span provide insight into determinants of health, disease, and aging. 
Through analyses in the UM-HET3 mice of the Interventions Testing Program (ITP), we detected a 
sex-independent quantitative trait locus (QTL) on chromosome 12 and identified sex-specific QTLs, some 
of which we detected only in older mice. Similar relations between life history and longevity were 
uncovered in mice and humans, underscoring the importance of early access to nutrients and early 
growth. We identified common age- and sex-specific genetic effects on gene expression that we 
integrated with model organism and human data to create a hypothesis-building interactive resource of 
prioritized longevity and body weight genes. Finally, we validated Hipk1, Ddost, Hspg2, Fgd6, and Pdk1 as 
conserved longevity genes using Caenorhabditis elegans life-span experiments. 


ging is a time-dependent functional 

decline in molecular, cellular, and orga- 

nismal homeostasis that is affected by 

multiple environmental and genetic 

factors (7). Genetic studies, such as loss 
or gain of function or forward genetic screens, 
have identified many of its conserved genes 
and mechanisms (2, 3). From the metabolic 
perspective, nutrient-sensing pathways such 
as the growth hormone and insulin-like growth 
factor axis (GH/IGF1), target of rapamycin 
(TOR), adenosine 5°-monophosphate (AMP)- 
activated protein kinase (AMPK), and sirtuins 
control aging or affect longevity (7, 3). In ad- 
dition to metabolic decline, other features 
such as loss of proteostasis, increased genome 
instability, changes in epigenetic marks, and 
alterations in telomere length have been pro- 
posed to promote aging (/, 4, 5). Here we used 
an unbiased genetic and molecular cross-species 
approach to highlight drivers of aging, starting 
with genetic mapping of loci that modulate 
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variation in life span in a large multicenter 
and multiyear four-way intercross of male and 
female untreated control mice from the Inter- 
ventions Testing Program (ITP). 


Multicenter and multiyear analysis identifies 
sex- and age-specific longevity loci 


The ITP was funded by the National Institute 
on Aging starting in 2004 to test putative 
positive effects of dietary and drug interven- 
tions on life span (healthy aging) (6, 7). For 
each intervention, a large and sex-balanced 
subcohort of control animals was aged until 
close to natural death along with the treated 
cases (8). All controls and cases were from a 
four-way intercross made by mating BALB/cByJ x 
C57BL/6J F, females—JAX Stock #100009—to 
C3H/HeJ x DBA/2J F, males—JAX Stock # 
100004—the so-called UM-HETS3 cross. Animals 
were bred at three independent testing sites 
(Jackson Laboratory, University of Michigan, 
and University of Texas Health Science Center 
at San Antonio) using F, animals from the 
Jackson Laboratory. The combination of a high 
level of genetic heterogeneity, very large family 
sample size, and balanced replication across 
sites has led to an unprecedented resource of 
longevity research (9). Tail biopsies were sys- 
tematically collected from almost all mice, thus 
enabling the genotyping and subsequent genetic 
analyses presented here (8) (Fig. 1A). 

We generated genotype data on 3276 mice, 
including 2356 control and 920 drug-treated 
mice (fig. S1, A and B, and data S1). The treated 
cases included animals subjected to 20 differ- 
ent drugs that had no population-level influ- 
ence on longevity; this complexity renders the 
statistical inference of genetic and drug ef- 
fects more challenging, and we therefore used 
this group in combination with the controls in 
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supplementary quantitative trait locus (QTL) 
analyses. In addition to being a genetically 
heterogeneous panel with a wide range of 
variation in longevity and body weight, the 
cohort (year of birth) and site have effects on 
body weight and longevity (most prominently 
in males) despite all efforts of standardization 
(10) (fig. S1, C and D). To avoid biases due to 
these nongenetic effects, we accounted for 
center and year in all analyses. 

As previously shown in the UM-HET3 cross 
(0), females have a higher median life span 
than males (female median = 886 days, 95% 
confidence interval (CI) [871-897], male me- 
dian = 836 days, 95% CI [816-851]), with the 
largest difference in the survival curves attrib- 
utable to increased early mortality in males 
between ages of about 300 and 800 days. 
Between 800 and 1000 days, females have 
a higher rate of mortality (fig. SIB). As of 
1000 days, mortality rates are similar, indi- 
cating similar hazard rates at later ages and 
similar maximum longevity (fig. S1, A to C). 

Given such age- and sex-specific hazards, we 
asked whether segregating loci affect life span 
in both sexes at all ages or whether sex- and 
age-specific effects are predominant (fig. $2). 
To determine whether genetic loci are sex 
specific, we performed four QTL mapping 
analyses: for each sex separately, and for the 
combined dataset with and without sex-by- 
genotype interaction term. To accommodate 
the multicenter and multiyear study design, 
we used mixed-effects Cox models with site 
and cohort as random effects and sex as 
fixed effect when appropriate. To assess the 
statistical significance of the results, we cal- 
culated empirical p values for the associations 
based on permutations (8) and characterized 
QTLs with p < 0.2. 

A genome scan including both sexes identi- 
fied a single QTL at chromosome 12, which 
may be a previously described locus in the 
UM-HET3 and BXD cohorts (Fig. 1B, data S2, 
and table S1) (/, 12). We also detect one sig- 
nificant female-specific locus at chromosome 
3 (p = 0.006) but no significant exclusive male 
loci. There is no evidence for sex-by-haplotype 
interaction, which may be related to power 
limitations due to sample size. 

Early male mortality raised the possibility 
that some early pathologies or behavioral fac- 
tors, possibly genetically driven, may mask 
genetic effects on longevity. Previously, this 
potential confounding was dealt with through 
removal of data from mice with early deaths, 
and this strategy has proven to be useful in 
identifying longevity-associated QTLs (77). We 
therefore performed successive QTL scans by 
removing increasing proportions of early deaths 
(Fig. 1C). To define peaks in these scans based 
on smaller subsets of the mice, we also relaxed 
the significance thresholds from 0.05 to 0.2. This 
approach revealed that the female chromosome 3 
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Fig. 1. QTL mapping of longevity identifies sex- and age-specific loci. 

(A) General scheme of the UM-HET3 four-way intercross and the end points 
examined in this study. The UM-HET3 mice come from a cross of a female 
(BALB/cByJ x C57BL/6J) F; and a male (C3H/HeJ x DBA/2J) Fy. The 

3276 mice in this study are either controls (n = 2356) or were treated with 
20 drugs that had no population-level effect on longevity (n = 920, data S1), 
from the NIA ITP conducted at two different sites (UT and UM) and spanning 
years 2004 to 2011. Natural life span, body weight, and DNA were collected 
from these mice. A separate UM-HET3 cohort (Glenn Center, UM) was used to 
obtain livers from young and old males and females for RNA sequencing to 
study sex- and age-related gene expression changes and haplotype-specific 
expression. (B) (Left) Distribution of female and male life spans. (Right) 

QTL mapping results for longevity in four analyses. The “Both, no sex int” 
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includes all mice with sex as a fixed effect and site as a random effect and 
cohort year as a nested random effect within site. The “Both, sex int” tests for 
sex-by-haplotype interactions. The Male and Female models are restricted to one 
sex. Solid and dashed horizontal lines indicate permutation-based significance 
thresholds for a = 0.05 and 0.2, respectively. (C) QTL scans for life span performed 
after excluding different percentiles of mice that die early. (Left) Distribution of life- 
span values after exclusion of mice. The labels indicate the minimal life span in days 
and the number of mice. (Right) Overlaid QTL scans, colored by truncation level for 
females and males. Solid and dashed horizontal lines indicate permutation-based 
significance thresholds as in (B). (D) Median longevity of mice stratified by haplotype at 
male loci on chromosomes 2 and 10, female loci on chromosomes 3, and male and 
females at the chromosome 12 locus, after exclusion of different proportions of early 
deaths. Alleles B, C, H, and D correspond to CBy, B6, C3, and D2 strains, respectively. 
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QTL is robust to removal of early deaths as it 
only slightly decreased in significance (empirical 
p value = 0.006 and 0.081 when 0 and 80% of mice 
are removed, respectively) even upon removal of 
80% of early deaths—that is, when only 20% of 
the longest-lived female mice were retained. One 
female-specific locus emerged on chromosome 
19 only in the 20% of longest-lived females. 

In contrast to females, multiple male loci 
increased in significance upon exclusion of 
early deaths. Specifically, the most significant 
locus is on chromosome 4 (at 20% truncation, 
empirical p value = 0.017) followed by less sig- 
nificant ones on chromosomes 2, 9, 10, and 12 
(at 60, 20, 80, and 20% truncation, respectively; 
0.05 < p < 0.2). Upon observing the longevity 
distribution of mice carrying different allelic 
combinations at the chromosome 2 locus, we 
could appreciate how allelic effects become 
significant when early-death animals were 
progressively removed (Fig. 1D). Male mice 
carrying the B/H allele combination in the 
chromosome 2 locus do not differ from the 
whole male population (p = 1, Fig. 1D, left 
panel), but are the longest lived as we removed 
earlier deaths (p = 0.137 at 80% truncation). 
The age-specific and male-specific chromo- 
some 10 locus (p = 1 and 0.185 at 0 and 80% 
truncation, respectively) overlaps with another 
previously described UM-HET3 longevity locus 
(11). The chromosome 12 locus detected in males 
overlaps with the one that we found in the 
analysis incorporating males and females 
and already identified in UM-HET3 (77). When 
examining sex-specific scans, the C/D allele 
combination offers an advantage in males 
(Fig. 1C, lowest empirical p value = 0.06 at 20% 
truncation in males) but not in females (Fig. 1C, 
lowest empirical p value = 0.357, no truncation). 

To further explore whether QTLs overlap 
with known genetic associations, we looked 
for correspondence with known longevity, 
growth, and body size QTLs in the Mouse 
Genome Informatics database (73) (table S1). 
Except for the chromosome 10 and 12 loci in 
males, there is no overlap with longevity or 
life-span loci, which is not surprising given 
the general dearth of longevity QTLs likely 
due to limited power of existing studies. Instead, 
we found extensive overlap with known growth 
or body weight loci, pointing to the impor- 
tance of genetic determination of growth rates 
on longevity. 


Interplay between early growth, size, and 
longevity in mice and humans 


The relation between growth, body size, and 
longevity has been extensively studied (10, 14, 15). 
This and our observation of overlap between 
longevity loci and growth QTLs prompted us 
to explore this relation further. At the pheno- 
typic level, we observed an inverse relation- 
ship between early body weight and longevity 
that is more pronounced in males (Fig. 2, A 
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and B). For instance, the hazard ratio of a 10-g 
increase at 6 months is ~1.8 and ~1.20 in males 
and females, respectively (mixed-effects Cox 
model p values < 0.00001 and 0.0016). This 
relation progressively decreased in magnitude, 
yet remained detectable until 18 months. At 24 
months, the relation disappeared in females 
and reversed in males, suggesting that lower 
body weight at an advanced age (between 18 
and 24 months) may be a predictor of earlier 
mortality in males as they lose weight prior 
to death. These observations are not only in 
agreement with studies in UM-HET3 mice 
(10, 14) but also with findings in humans, in 
which higher childhood weight is associated 
with higher morbidity and mortality, especially 
in males (J6, 17), and weight loss at older ages is 
associated with increased frailty and mortality 
risk (18, 19). 

Before exploring the genetic determinants 
of body weight variation, we characterized the 
effects of nongenetic factors on body weight 
and whether they can contribute to variation 
in longevity in the UM-HET3 mice. Because 
the differences in body weight can be attributed 
to genetic and environmental factors, we 
wondered whether early access to nutrients as 
a result of litter size differences could affect 
growth, and consequently adult body weight 
and longevity (20). We used litter size infor- 
mation of a subset of 1575 control mice aged at 
the University of Texas, San Antonio. Indeed, a 
significant negative effect of litter size on male 
and female body weights at 6, 12, 18, and 
24 months of age was observed, indicating 
long-lasting effects of early growth due to 
differential access to nutrients (p < 0.05 
for female 24-month body weight, p < 0.001 
for the rest; fig. S3, A and B). Furthermore, we 
observed an effect of litter size on longevity, 
apparently acting through body weight, as 
smaller litter size is associated with higher 
weight, which, in turn, is associated with 
shorter life span (8) (Fig. 2C). We therefore 
investigated whether including body weight 
and litter size as covariates in a longevity 
QTL scan would affect the results. Life span 
was remapped for males and females sepa- 
rately, without body weight, with body weight 
at 6 months (an age at which the relation is 
strongest), and with body weight at 6 months 
and litter size (fig. S3C). This analysis was 
performed for a total of 1122 control mice of 
both sexes, which resulted in similar link- 
ages to the analysis excluding litter size as a 
covariate, albeit with reduced significance 
(highest peak in females is at chromosome 3 
with empirical p value = 0.399). Logarithm 
of the odds (LOD) scores and significance 
are reduced by inclusion of body weight only 
in males. The inclusion of litter size in ad- 
dition to body weight has little to no effect, in 
agreement with the mediation analysis in 
which the effect of litter size on longevity is 
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mediated by body weight. Although the indi- 
vidual findings and pairwise relationships 
between litter size-growth and growth-longevity 
are not novel, here we provided an integrated 
view in a large and genetically heterogeneous 
mouse population. 

We next asked if these relations are specific 
to UM-HET3 mice and whether we could es- 
tablish similar associations in humans. We ap- 
plied a Mendelian randomization (MR) strategy 
to large human datasets. Specifically, we assessed 
whether early and adult body size or height 
relate to parental longevity (a proxy of the 
individual’s longevity) and indeed, early body 
weight at 10 years, height at 10 years, adult 
body mass index (BMI), but not adult height 
have potential negative effects on longevity 
(Fig. 2D) (27-23). An interesting finding is 
that the early body size effect is completely 
mediated by adult BMI (Fig. 2E). Height, by 
contrast, has a more complicated relationship 
with longevity, likely because it is also asso- 
ciated with higher socioeconomic status, which 
itself extends life span, and may be affected by 
population stratification (Fig. 2F) (24). Faster 
early growth may therefore be detrimental, 
but adult height may have antagonistic effects 
on longevity as it is jointly affected by growth 
and socioeconomic status. 

To further deconstruct this, we performed 
an MR analysis for early and late height while 
excluding shared instrumental variables; that 
is, we used genetic instruments that are specific 
to each stage (Fig. 2G). Notably, the opposing 
effects become larger and more significant, 
supporting the hypothesis of distinct antag- 
onistic age-specific effects of early and adult 
height on longevity. Finally, we analyzed 
whether the early height’s effect is mediated 
by adult BMI in a manner similar to early 
weight through MR mediation and found that 
this is not the case, supporting a distinct rela- 
tionship between early weight and height and 
longevity (Fig. 2H). An analogous investiga- 
tion is not possible in our mouse population, 
as only body weight is available and not length 
or adiposity. Taken together, the interplay be- 
tween different life-history traits that is similar 
between mice and humans shows that determi- 
nants of longevity can be decomposed into 
distinct genetic factors acting on other life- 
history traits, as well as nongenetic factors 
such as early growth, which may be controlled 
by early access and use of nutrients. 


Analysis of body weight QTLs in relation to 
the longevity QTLs 


Given the conserved age-dependent relation- 
ship between longevity and body weight across 
sexes and that this is also the case in humans, 
we explored whether variation in these traits 
is explained by the same genetic alleles. If that 
is the case, one explanation would be that 
the same genes may be universally affecting 
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Fig. 2. Interplay between early growth, size, and longevity in mice and 
humans. (A) Inverse relationship between body weight and longevity is stronger 
in males than in females and at younger ages. Body weight trajectories of UM-HET3 
mice, separated by sex. Each line represents body weight evolution of a single mouse. 
Body weight measurements were performed at 6, 12, 18, and 24 months, which are 
marked by vertical dashed gray lines. Mice are split into seven groups and colored 
on the basis of their longevity. Violin plots show the distribution of body weight 
at the measured ages. (B) Mixed-effects Cox model hazard ratio of an increase 
in 10 g in body weight in males and females, as well as the 95% confidence interval 
and p value (8). (C) Effect of litter size on survival mediated by body weight at 
different ages using mediation analysis (b indicates effect size, and nis the number 


of instruments). (D) Inverse variance weighted Mendelian randomization results 
for body weight and height and parental life span (22). Early body size and heights 
are the comparative body weight and heights at age 10 of ~500,000 individuals 
from the UK Biobank (data fields 1687 and 1697, respectively). BMI (body mass index) 
is from ~700,000 individuals (23). (E) MR-Mediation analysis shows that the 
effect of early body weight is completely mediated by BMI. (F) MR-Mediation shows 
that the effect of early height is not mediated by height. (G) Because adult and 
early height share genetic associations (or instrumental variables, IVs), MR was 
performed with the IVs that are specific to each phenotype, excluding the common IVs, 
further enhancing the opposing effects of early and adult height. (H) MR-Mediation 
shows that the effect of early height is not mediated by BMI. 


different facets of aging, for which body weight 
and longevity act as surrogate traits. For that, we 
performed QTL mapping in the UM-HET3 mice 
with a total of four models, as we had done for 
longevity (Fig. 3A, and fig. S4A). 
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We detected sex-specific body weight QTLs 
that are conserved at different ages. For in- 
stance, we identified a female body weight 
QTL at chromosome 7 at all ages, and one at 
chromosome 6 that is significant until 18 months 
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(p < 0.05). Some other QTLs were also conserved 
in at least two different ages such as those on 
chromosomes 5,6, and 15 in females and 9 and 
17 in males. Indeed, a previous QTL study in 
UM-HET3 combining two sexes had already 
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Fig. 3. QTL mapping of body weight identifies sex-specific loci and 


evidence of shared genetic effects on longevity in males. (A) QTL mapping 


peak location of each locus is marked by a white dot. Red arrows mark 
oci on chromosomes 9 and 10 where QTL signals for longevity and bod 
colocalize. (B) Scatterplot of the LOD scores at each marker for longev 


marker is labeled and colored by the chromosome on which it resides. 
show the markers that were further examined in (C) and (D). (C) Effect 
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esults for sex-specific body weight at different ages and longevity scans at 
different exclusion levels. Each locus is colored by the -logl0(empirical p value), 
and QTL peaks are marked with a black line [see (8) for peak definition]. The 


the two 
ly weight 
ity and 


body weight at 6 months on the vertical and horizontal axes, respectively. Each 


Circles 
plots for 


body weight at 6 months by genotype, as well as survival curves, at the 


chromosome 9 and 10 loci in males (excluding 20 and 80% of early deaths, 
respectively). In the effect plot, the mean body weight and standard deviations 
per genotype are represented, as well as the difference between the highest 
and lowest averages. The dashed red lines are visual aids to connect the 
longevity and body weight QTL colocalizations in (A) and the allelic effects in (C). 
(D) Mixed-effects Cox model of the association of genotype and body weight as 
a function of different truncation proportions (nine truncation levels from O to 
0.8). Points are drawn when the p value is less than the Bonferroni-corrected 
p-value threshold (0.05/9 = 0.0056). B/H is the reference genotype to which all 
others are compared. The mediation analysis is performed at the indicated 
truncation levels (0.2 and 0.8 for chromosome 9 and 10 loci, respectively) to 
match the longevity association results. 
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identified the loci on chromosomes 6, 7, and 
15 (14) that also agree with the findings here in 
terms of directionality and magnitude (Fig. 
3B). Notably, these earlier results were per- 
formed in one site, whereas the current data 
are from two sites across many years. With the 
exception of a locus on distal chromosome 9 
for male and female body weight, which also 
overlaps with the male longevity QTL, there 
were no instances of the same QTL emerging 
in both sexes (fig. S4A), indicating that the 
genetic effects on body weight are predominantly 
sex specific. In sex-combined QTL scans, some 
QTLs overlap with existing sex-specific QTLs 
(chromosomes 10 and 17 for males and 5, 6, 7, 
and 15 for females), and some additional QTLs 
are evident on chromosomes 2 and 11. In 
terms of sex-by-genotype interaction, we iden- 
tified loci on chromosomes 7 and 15. We also 
performed QTL scans for body weight change 
that yielded a single QTL on chromosomes 4 
in males and 5, 6, and 7 in females, over- 
lapping with body weight QTLs (fig. S4A). 
These results do not support the presence of 
genetic modifiers of rates of body weight 
changes in adult mice that are distinct from 
the determinants of body weight. 

Although we identified significant QTLs in 
most analyses, this ensemble of QTL mapping 
results for longevity, body weight, and body 
weight change suggests that there is little QTL 
sharing across sexes and traits (Fig. 3A; fig. S4, 
A and B; and data S2 and S3). There are no 
overlaps in females, whereas in males, the 
chromosome 4 and 9 loci and the age- and 
male-specific longevity locus at chromosome 
10 overlap with body weight loci. Indeed, 
when we observed the genome-wide relation- 
ship between the LOD scores for longevity and 
body weight QTL scans, we saw that they cor- 
relate in males but not in females, paralleling 
the strong relationship at the phenotypic level 
(Pearson correlation in males = 0.339, p = 2.33 x 
10°”, Fig. 3B). The male-specific colocaliza- 
tion may either be the result of the two traits’ 
correlation or due to genetic effects on longevity 
being mediated by genetic regulation of body 
weight. To distinguish between these two pos- 
sibilities, we examined the effect of each genotype 
on longevity and body weight and performed 
mediation analyses. 

We found no relationship between the genetic 
association with body weight and longevity on 
chromosome 4: (fig. S4C). That is, mice carrying 
the lightest allele are not the longest-lived or 
vice versa. This is an indication that body weight 
modulation and longevity are distinctly affected 
by genetic variants in this locus. By contrast, 
male mice with the B/D combination on the 
distal chromosome 9 locus are the heaviest 
at 6 months and the shortest-lived (Fig. 3C). 
Similarly, mice carrying the C/H at the chromo- 
some 10 longevity locus (when 80% of deaths are 
excluded) are the lightest at 6 months and the 
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most long-lived (Fig. 3C). These patterns chro- 
mosomes 9 and 10 raise the possibility that the 
effect of genetic variation on longevity is through 
modulation of body weight or growth. 

To further examine these two loci, we tested 
whether including body weight in a Cox model 
would extinguish the genetic effect we observe 
on longevity (Fig. 3D). For chromosome 9, we 
observed that body weight, and not genotype, 
is significantly associated with hazard (p = 
6.15 x 10"). In addition, a mediation analysis 
illustrates that the effect of genotype on longevity 
is predominantly mediated by body weight. 
The chromosome 10 male locus, however, shows 
signs that the genetic effect on longevity is 
independent on the effect on body weight. The 
C/H genotype combination is significantly 
associated with male longevity when at least 
70% of early deaths are removed even when 
controlling for body weight (p = 0.0021). In 
addition, the mediation analysis shows that 
the effect of genotype is direct and not through 
body weight. Taken together, these results de- 
signate the loci on chromosomes 4 and 10 as 
male-specific longevity modifiers through an 
unknown mechanism, whereas a locus on chro- 
mosome 9 may be modulating longevity through 
its effect on growth or body weight. 


Liver gene expression reveals age- and 
sex-related differences in gene- and 
haplotype-specific expression 

Having identified several QTLs for longevity 
and body weight, we next explored the genes 
under the QTL peaks in an effort to shortlist 
candidate longevity genes. Because the identi- 
fied QTLs are large owing to limited recombi- 
nation in a four-way intercross, there may be 
dozens of genes under each QTL. Further- 
more, genetic variation within or around these 
genes may affect the gene’s function either 
directly or through regulatory variation (25). 
We devised multiple strategies to home in on 
candidate causal genes within these loci, none 
of which provides enough confidence in isola- 
tion. For instance, endophenotypes such as 
gene expression allow a more fine-grained 
investigation of genetic effects on quantitative 
traits and typically have a simpler genetic 
architecture. We performed RNA sequencing 
(RNA-seq) of 33 livers from adult (6 month) 
and old (22 month), male and female UM-HET3 
mice to identify genes with allele- and haplotype- 
specific expression (HSE). 

Prior to HSE analysis, we characterized the 
effects of sex and age by performing differen- 
tial gene expression for age, sex, and age-by-sex 
interaction, followed by gene set enrichment 
analysis (8) (fig. S5A and data S4). Although 
most of these age-related changes in gene ex- 
pression are not expected to be causal (26), they 
provide insights into the footprint of aging 
(27, 28). Differential expression in livers of 
UM-HET3 mice has already uncovered genes 
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whose expression is related to life span-extending 
interventions (29). However, it is not clear if 
differentially expressed genes are more likely 
to explain interindividual differences in aging 
or longevity than genes whose expression re- 
mains stable with age. There were 444: sex- and 
190 age-related differentially expressed genes 
(absolute log, fold change > 1, BH-adjusted p < 
0.05). The most significant functional enrich- 
ment for the sex effect is in the “cellular response 
to interferon-6” gene ontology term, with genes 
expressed higher in females (adjusted p = 0.047; 
Fig. 4, A and B). The interferon response is not 
only sexually dimorphic and higher in female 
mice (30) but also a major determinant in 
mediating severity of infections, including 
COVID-19, likely explaining higher mortality 
in men who have lower type I interferon acti- 
vity than women (37, 32). It is therefore possible 
that differences in interferon response may play 
a contributing role in the sex- and age-specific 
differences in disease susceptibility. 

As for the effect of age on liver expression, 
we observed very significant enrichment of 
immune-related terms in genes whose expres- 
sion increases with age (adjusted p < 0.05) and 
no significant enrichment in those with de- 
creased expression (Fig. 4A). This is in line 
with previous findings that there are more 
genes that are consistently expressed with age 
than are underexpressed and that the overex- 
pressed genes tend to be immune related (3.3). 
Specifically, the most enriched biological proces- 
ses relate to leukocyte activation and prolif- 
eration, possibly reflecting a change in tissue 
composition due to increased number or ac- 
tivity of infiltrating or resident immune cells 
in the aged liver. Indeed, published single-cell 
RNAseq data indicate that changes in liver 
cellular composition, as well as an increase in 
immune signature, occur with age (34). Specif- 
ically, the relative number of hepatocytes de- 
creases, whereas the number of liver sinusoidal 
endothelial cells (LSECs) increases. By contrast, 
Kupffer cells seem to remain stable in numbers, 
but more of them express /ZIb in older mice. To 
corroborate these findings, we performed single- 
cell deconvolution analysis on the bulk RNA- 
seq results (8) (Fig. 4C). We detected changes 
in estimated cell type proportions only in 
males, with significant loss in hepatocyte num- 
ber (Dirichlet regression p value = 0.00024) and 
gain in B cells, Kupffer cells, and LSECs (p = 
0.011, 0.013, 9 x 10~*, respectively), in line with 
single-cell analyses (34). Taken together, the 
age-related changes in gene expression in 
UM-HETS livers support established functional 
and compositional changes, an indication of 
the conservation of major age-related changes 
across studies. 

We also exploited the presence of segregat- 
ing UM-HETS3 variants to detect HSE (8) (Fig. 
4, D and E; fig. $5, B to E; and data S65). 
Allelic imbalance is a pervasive phenomenon 
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Fig. 4. Liver gene expression reveals age- and sex-related differences in 
gene and haplotype-specific expression. (A) Gene set enrichment analysis of 
biological processes in sex- and age-related differences in UM-HETS liver gene 
expression. A positive value indicates an enrichment of a gene set driven by 
higher expression of its genes in males (in the left panel) or an expression 


increase due to age (in the r 
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was first genotyped at every polymorphic locus, then haplotype calling was 
performed. The numbers of maternal and paternal reads were collected and 
used in several binomial generalized linear mixed models to test for the 
effect of haplotype, age, and sex. (E) HSE results for the whole dataset. Within 
each model, genes were divided into whether the parameter’s p value is 
significant (at a BH-adjusted p < 0.05). “Not tested / NA” indicates that 
coefficients could not be estimated for a model component for a certain gene. 
Stacked bar plots indicate the number of genes in each group. Text labels are 
the number of genes. (F) Hierarchical clustering of genetic relatedness of 
parental strains explains differences in the number of detected HSE genes 
per genotype. The more distant the strain pair, the more HSE genes are 
detected (8). (G) UpSet plot of the number and sharing of genes with 
significant HSE across different analysis subsets. 


in mice that is affected by parent-of-origin 
effects and regulatory variation, which con- 
tributes to complex traits (35-37). The main 
rationale is that an allelic imbalance under a 
QTL of a complex trait may be causal and 
underlying variation in longevity or body weight. 
The fact that allele-specific expression is directly 
implicated in disease or useful in inferring causal 
mechanism of disease-associated loci provides 
further support to this approach (38). Briefly, 
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we inferred genotypes on the basis of the RNA- 
seq data and used a binomial generalized 
linear mixed model to infer whether sex, age, 
or specific haplotype combinations affect the 
ratio of maternal to paternal transcripts or 
allelic imbalance. The data and specific exper- 
imental design allowed us to assess HSE in 
~6000 genes, using ~70,000 variants (Fig. 4E). 
Using this strategy, we attributed the measured 
bias in maternal to paternal reads to age, sex, 
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or one of the genotypes. For example, there are 
23 and 62 genes for which a bias in expression 
can be attributed to sex and age, respectively. 
As expected, identified HSE is mostly attributed 
to genotype effects, with B/D having the most 
and C/H the fewest HSE events (596 and 351 
significant, respectively). It is noteworthy that 
the UM-HETS3 parental strains are not genet- 
ically equidistant from each other, with B6 
being the most distant from the three other 
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strains and C3 and cBy being the closest pair 
(Fig. 4F). This genetic relatedness, coupled with 
the cross design, implies that some haplotype 
combinations are more likely to have HSE 
than others, simply because of differences in 
the abundance of variants enabling HSE infer- 
ence (Fig. 4, E and F). Specifically, it is more 
likely to detect HSE in genes in mice that are 
carrying the B maternal allele, as the B6 strain 
is genetically the most distant from the three 
other strains. 

We compared the results of HSE sepa- 
rately on different subsets (males of both ages, 
females of both ages, young adults of both 
sexes, old of both sexes) by finding overlaps 
of genes with HSE (Fig. 4G). The largest set 
of genes (nm = 504) is the intersection of all 
subsets and consists of genes that have HSE 
independent of sex or age, likely ones with 
robust determinants in cis. The next largest 
set (2 = 184) is male specific, followed by one 
that is specific to old mice (male and female), 
suggesting that some of the observed HSE 
effects may be mediated by sex- or age-specific 
factors. 


Cross-species search of candidate genes 
implicated in longevity modulation 


Having identified QTLs for body weight and 
longevity, gene expression differences by sex 
and age, and HSE for different ages and sexes, 
we combined all this information to catalog 
and develop a simple scoring scheme for genes 
under QTLs (8) (table S2, Fig. 5, and data S6). 
Because genetic variation can affect a protein- 
coding gene’s function, we examined whether 
genes have high-impact variants (HIVs) that 
may affect their splicing or coding potential 
in one of the parental strains. We further 
determined whether these variants segregate 
in the UM-HETS3 in a way that is consistent 
with the observed QTL effect. We postulated 
that genes under the QTL with age-related 
differential expression (DE) could be of inter- 
est in hypothesis building, even if DE may be a 
result and not a cause of aging, so we inves- 
tigated whether the gene is differentially 
expressed in the liver dataset. Finally, to detect 
regulatory variant effects on gene expression, 
which could be driving the QTL signal, we 
asked whether the genes exhibit HSE. 

Going beyond UM-HETS3, we expanded the 
search toward other mouse and model orga- 
nism databases. We examined whether QTL 
genes are differentially expressed in all the 
tissues of Tabula Muris Senis (TMS) (34) and 
gave an additional point if they are differen- 
tially expressed in multiple tissues. In addition, 
we extended the search to model organisms and 
cross-referenced the genes and their orthologs 
with the GenAge database, which contains a 
curated list of longevity genes (39). Additional 
points were given if a gene is a known longevity 
gene in mice or in more than one species. 
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In addition to analysis of mouse data, we 
also characterized the genes’ human orthologs 
by mirroring the available mouse data gen- 
erated in this study that span the genetic, gene 
expression, and genetically driven gene expres- 
sion (8). First, we checked for age-associated 
DE in all GTEx tissues (40). Additional points 
were given if the gene is differentially expressed 
in more than one tissue. Then we looked at 
whether there is any human genome-wide 
association study (GWAS) hit for longevity 
(or BMI for the mouse body weight QTLs). To 
establish genetic links between gene expres- 
sion and BMI and longevity, we performed 
transcriptome-wide Mendelian randomiza- 
tion (TWMR) (42. The combination of all these 
data serves as a hypothesis-building knowledge 
base for longevity and body weight genes. Figure 
5B shows genes with a minimum score of 5 
under the longevity QTLs of females, and males 
with 20% and males with 80% truncation, 
representing early and late determinants of 
male longevity (data S6 includes all the an- 
notations and scores). Although the results of 
this investigation cannot attribute causal roles 
to these genes in longevity modulation, they 
provide valuable candidates for in-depth me- 
chanistic studies that capitalize on our system’s 
genetics observations. We illustrate this by per- 
forming survival analyses in C. elegans to test 
whether RNA interference (RNAi)-knockdown 
of the worm orthologs of the top-scoring genes 
would modulate longevity, therefore highlight- 
ing interesting genes for follow-up (Fig. 5C 
and fig. S6). 

In the female longevity locus on chromo- 
some 3, the five highest-scoring genes are Ciss, 
Ecmi, Ferll, Hipki, and Etv3 (aggregate score = 
5; Fig. 5B and data S6). Among those, we could 
validate the orthologs of Ctss (KO2E7.10 and 
Y113G7B.15) and Hipk1 (hpk-1) in the worm 
and find that hpk-7 RNAi significantly reduces 
life span (p < 0.0001; Fig. 5C and fig. S6). Indeed, 
hpk-1 is already annotated as a worm pro- 
longevity gene in GenAge (42), and the tran- 
scriptional cofactor that it encodes has been 
shown to maintain proteostasis and extend 
longevity (43). The ETS Variant Transcription 
Factor 3 (Etv3), which has an HIV with segre- 
gation patterns concordant with longevity and 
is differentially expressed in UM-HETS3 and in 
TMS, has not been implicated in mammalian 
aging. However, the Drosophila ETS family 
members, Aop and Pnt, have been found to 
control longevity (44). Similarly, Ctss has HIV, 
HSE, and is differentially expressed in mul- 
tiple TMS tissues, yet has not been implicated 
in mammalian aging. More in-depth studies 
would be required to assess the roles of the 
top-scoring genes. 

For the early male longevity QTLs excluding 
20% of deaths, the top-scoring genes are Alpi, 
Eif4g3, Ecel, Pafah2, Hspg2, and Ddost (aggregate 
score > 5; Fig. 5B and data S6). The highest- 
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scoring gene is the tissue-nonspecific or liver, 
bone, kidney-type alkaline phosphatase (Alp) 
gene that also harbors an HIV, exhibits HSE 
but has a decreasing expression in the UM-HET3 
liver, and is differentially expressed in GTEx 
adipose tissue. More investigation is required 
to confirm this gene and establish a mecha- 
nistic link with longevity. We tested orthologs 
of all genes except Alpl, which does not have a 
worm ortholog, and found that wnc-52 (Hspg2) 
and ostb-1 (Ddost) RNAi both significantly shorten 
life span (p < 0.0001; Fig. 5C and Fig. S6). Al- 
though wnc-52 is already known to affect worm 
life span (45, 46), ostb-1 or its ortholog (Ddost) 
has not yet been directly linked to longevity. 
However, Dolichyl-diphosphooligosaccharide- 
protein glycosyltransferase noncatalytic sub- 
unit (DDOST) is implicated in processing 
advanced glycation end products (AGEs) (47), 
which accumulate with age and exacerbate the 
aging phenotype (48). In the UM-HET3, the 
gene has HIVs as well as HSE, which may 
affect rates of AGE processing and ultimately 
longevity. 

In the late male longevity QTLs excluding 
80% of deaths, 11 genes have an equal ag- 
gregate score of 5: Ank3, Slc25a3, Vezt, Stab2, 
Gnptab, Tdg, Fgd6, Slc17a8, Abcbi1, Slc25al12, 
and Pdk1 (Fig. 5B and data S6). We tested the 
orthologs of Ank3, Slc25a3, Fgd6, SlcI7a8, 
Abcbil, Sic25a12, and Pdk1 in C. elegans (Fig. 
5C and fig. S6). Of those, tag-77 (Fgd6) and 
pdhk-2 (Pdk1) shortened and increased life 
span, respectively (p = 0.0019 and p < 0.0001). 
Whereas the tag-77 RNAi life-span reduction is 
modest compared with the other significant 
RNAi effects (~14% reduction in median and 
maximum life span), a locus near human FGD6 
(rs12830425) has been recently found to be 
associated with health span, life span, and 
exceptional longevity, rendering it a very inte- 
resting cross-species aging and longevity candi- 
date gene (49). Pyruvate dehydrogenase kinase 
1 (Pdk1) regulates cellular energetics, as its 
kinase activity inactivates pyruvate dehydrogenase 
(PDH) activity, thereby shutting down glucose 
oxidation in the mitochondrion. In agreement 
with our results, RNAi and mutations in pdhk- 
2 have been previously shown to extend worm 
life span (50). As PDH complex has been 
proposed to be a therapeutic target for age- 
related diseases, Pdk1 modulation may be ap- 
pealing as an anti-aging intervention because 
it is a known target of dichloroacetate (57). Some 
of the top-scoring late male longevity QTL genes 
encode mitochondrial proteins (Slc25a3, 
Slc25a12, Pdkl). Mitochondrial dysfunction 
has been extensively linked to growth and 
longevity regulation in model organisms (52-54). 
In addition to their metabolic role, mitochondria 
affect other features of aging such as epigenetics 
and proteostasis, hence acting as a central hub 
in the aging process (4, 5). As a case in point, 
mitochondrial health is associated with a 31% 
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Fig. 5. Annotation, prioritization, and validation of candidate longevity 
genes. (A) Diagram of the gene annotation and scoring pipeline. Scores are 
indicated in red. Genes under longevity QTL peaks were annotated based on 
whether they (i) have a high impact variant (HIV) in one of the UM-HET3 parents; 
(ii) have an HIV whose parental alleles are in line with the direction of the QTL’s 
effect; (iii) are differentially expressed in the UM-HETS livers; (iv) show haplotype- 
specific expression in UM-HETS livers; (v) are differentially expressed in any tissue in 
the Tabula Muris Senis (TMS); (vi) are differentially expressed in more than one TMS 


multiple tissues; (xii) have been identified in human GWAS for longevity or age- 
related disease; or (xiii) have a link to longevity through transcriptome-wide 
Mendelian randomization (TWMR) (41). Genes that have a worm ortholog and an 
available RNAi clone were further validated and marked based on whether there is 
shortened, extended, or unchanged life span. (B) Top-scoring genes under 
selected longevity QTL genes. (C) Significant effects of RNAi-mediated knockdown 
of worm orthologs of some of the highest-scoring genes. Eighty worms were used 


per group and the log-ra 


nk test was performed. Worms that escaped from the 


tissue; (vii) are or have orthologs that a 
the mouse; (ix) are in GenAge in multip! 
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le species; (x) have a differentially expressed 


human ortholog in any GTEx tissue; (xi 


) are differentially expressed in GTEx in 


plates or had vulva explosions were censored from the life span measurements. 
Nominal p values are indicated and significance is determined after Bonferroni 
correction (n = 19 genes tested in total; see fig. S6 for all tested genes). 


longer life span in rats selectively bred for higher 
intrinsic exercise capacity (55). It is worth 
noting, however, that the late male longevity 
locus on chromosome 10 also contains genes 
known to affect aging such as Jg/1, Sirti, and 
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Sirt6, yet these genes have not been priori- 
tized on the basis of criteria used in this 
study (data S6). 

As the results here serve for hypothesis gen- 
eration rather than to establish causal relations, 
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they provide a resource for the community. 
For that purpose, we developed an interactive 
web application that is helpful to explore the 
entirety of the results (https://www.systems- 
genetics.org/itp-longevity). 
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Discussion 

Our analyses draw a general picture in which 
the effects of genetic factors are mostly sex, 
age, and trait specific. One key finding is that 
many of the QTLs seem to modulate life span 
only in mice beyond a certain age, especially 
in males. We find that genetic determinants 
of longevity and body weight variation are dif- 
ferent and sex specific even though the two 
traits are highly correlated. These two traits 
are more correlated in males, leading to some 
overlaps in the identified QTL peaks. Using 
mediation analysis, we could discern whether 
overlaps in male-specific longevity and body 
weight QTL are due to distinct genetic effects 
on each trait (chromosomes 4 and 10) or genetic 
effects on body weight affecting longevity 
(chromosome 9). 

Although our work did not document a 
large number of segregating loci with detect- 
ably large effects on life span in both male and 
female mice, it is possible that life span is in- 
stead modulated by a large number of loci with 
smaller individual effects. It is also possible that 
life span is modulated by interactions among 
segregating loci (gene-by-gene interactions). 
Regulation of insulin-like growth factor 1 (IGF) 
levels in UM-HET3 mice, for example, has been 
shown to be influenced largely by three-gene 
and four-gene combinations (56), so that ex- 
treme levels of IGF1 require allele combinations 
present in only 1/8 or 1/16 of the population. 
Our methods do not consider gene-by-gene 
interactions of this kind and may have missed 
life span loci that have effects only in specific 
genetic contexts. 

The combination of high heterozygosity in 
any single mouse combined with variability 
across mice means that the UM-HET3 are 
more heterogeneous than most mice currently 
used in research, most of which are inbred. 
However, we note that all four inbred grand- 
parents of the UM-HET3 population have 
undergone hundreds of generations of selec- 
tion for breeding success in laboratory con- 
ditions and may well have lost genetic 
variants, present in natural mouse popula- 
tions, with major effects on maturation rate, 
growth rate, body size, IGF1 levels, and life 
span. There is evidence, for example, that off- 
spring of crosses between wild-trapped mice 
and laboratory-adapted stocks of mice vary in 
IGF1 levels, with high life span associated with 
lower IGF! levels within these segregating F, 
populations (57). Restricting our genetic anal- 
ysis to alleles present in laboratory-adapted 
stocks would prevent us from detecting alleles 
present only in wild mouse populations. 

It is also plausible that segregating alleles 
that do influence life span in UM-HET3 mice 
are principally effective in one but not both 
sexes. Genetic analysis of Drosophila longevity 
(58) has shown that most life-span loci are 
indeed sex specific, and studies of drugs that 
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modulate mouse life span have also shown sex 
specificity for more than half of the effective 
drugs (59-61). The apparent near-absence of 
loci with strong, sex-independent effects on 
life span does not preclude the possibility that 
shared processes—such as proteostasis, mito- 
chondrial protection, inflammation, stress- 
resistance pathways, and many others—could 
contribute to regulation of aging and longevity 
in both sexes. 

In addition to genetics, nongenetic factors 
such as very early growth and access to nu- 
trients play a role in determining longevity. 
Although the relationship between early growth, 
metabolism, and growth has been established 
through direct experimentation (/4, 62), it is 
important to shed light on these effects in 
the ITP UM-HET3 mice. Understanding how 
genetics affects longevity will thus inevitably 
require accounting for the different effects 
of genetics, life history, and environment on 
processes that affect aging, disease risk, and 
mortality. Indeed, we show that litter size has 
a negative association with body weight and a 
marked indirect effect on longevity. 

Not all aspects of mouse physiology are ex- 
pected to be identical in humans, as has been 
shown in a study showing that, in contrast to 
humans, mouse fasting glucose levels increase 
with age and higher glucose is associated with 
longer life span (63). In the same study, how- 
ever, the longitudinal trajectories of body weight 
and fat were shown to be similar between mice 
and humans. Therefore, there may be sufficient 
similarities for genetic studies in mouse pop- 
ulations to inform studies on human aging. 
Here we show that relationships between early 
growth, body size, and longevity are also con- 
served in humans. Specifically, the MR and MR- 
mediation analysis of the UK Biobank data 
indicate that early growth is likely causal in 
determining adult longevity, through its effect 
on adult BMI. BMI is determined by adiposity 
and size in different ways than body weight 
in mice; hence exploring similarities between 
mice and humans will require measurements 
of body composition and size in mice, data 
which are not available in the ITP. 

In terms of gene expression, the liver 
profiles reveal a strong increase in immune- 
related transcripts with age, which is indica- 
tive of chronic inflammation or inflammaging 
(64). Another case in point is that females, 
who are longer-lived than males, have a higher 
expression of interferon-B (IFN-f)-related 
genes, consistent with previous observations 
(30). This is reminiscent of the situation in 
humans where the more vigorous IFN-f re- 
sponse in women has been invoked to explain 
the attenuated severity of infections, including 
COVID-19, likely explaining higher mortality 
in males (37). It is therefore possible that dif- 
ferences in interferon response gene activity 
may play a contributing role in the sex- and 
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age-specific differences in disease suscepti- 
bility. We capitalized further on the fact that 
UM-HET3 mice are heterozygous to detect 
extensive haplotype-specific expression across 
two sexes and two ages. This information was 
used to help prioritize genes under QTL peaks 
and could also serve as a resource for more in- 
depth study of the effect of regulatory varia- 
tion on gene expression. 

The characterization of genes under QTLs 
provides a resource for further investigation 
of their putative roles in mediating longevity 
and body weight variation (see interactive ap- 
plication: https://www.systems-genetics.org/ 
itp-longevity). Future studies that include more 


ITP mice as well as denser genetic maps may 


yield improved QTL results. Finally, more longi- 
tudinal data collection accompanied by exten- 
sive phenotyping would be required to shift the 
focus from longevity to a better understand- 
ing of genetics of healthy aging or health span. 
There is therefore great potential to reach valu- 
able insights into aging by feeding into the 
virtuous cycle between mouse genetics and 
human drug discovery, which has already been 
a successful paradigm in identifying disease 
genes for drug discovery (65). 


Materials and methods summary 
Phenotype and genotype data of UM-HET3 mice 
from the ITP 


The UM-HETS3 mice are genetically heteroge- 
neous progeny of CByB6F1 females and C3D2F1 
males, produced at each of three sites—the 
Jackson Laboratory (JL), the University of 
Michigan at Ann Arbor (UM), and the Univer- 
sity of Texas Health Science Center at San 
Antonio (UT) (6). Experiments at the JL, the 
UM, and the UT were reviewed and approved 
annually by the Institutional Animal Care and 
Use Committees at each site. Life span and weight 
data for ITP mice were compiled by the labo- 
ratories of D. Harrison, R. Miller, and R. Strong, 
and are available in the Mouse Phenome 
Database (66). Litter size data were provided by 
J. Nelson. Tail samples were sent to GeneSeek 
(Neogen Corporation), where genomic DNA was 
extracted, followed by single-nucleotide poly- 
morphism (SNP) genotyping by matrix-assisted 
laser desorption/ionization-time-of-flight mass 
spectrometry (67) at a maximum of 270 markers. 


QTL mapping 

Longevity 

To assess the effect of haplotype on longevity, 
we used the coxme package to fit multiple 
mixed-effects Cox models. The full model in- 
cludes the two sexes and has sex as a fixed 
effect and site as a random effect. The cohort 
year was included as a random effect that is 
nested in site. The sex-specific models do not 
have sex as a fixed effect. To define significance 
thresholds, we performed 1000 permutations 
with label swapping. We systematically performed 
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QTL mapping for longevity by eliminating dif- 
ferent percentiles (or quantiles) of the shortest- 
lived mice. 


Body weight 


We employed a similar strategy as for longev- 
ity but using the Ime4 package to fit mixed 
linear models. 


Web application 


We developed an application using the R Shiny 
package to allow researchers to browse the raw 
data and results of this project: https://www. 


systems-genetics.org/itp-longevity. 
Full materials and methods are available in 


the supplementary materials (8). 


REFERENCES AND NOTES 


1. 


10. 


Bou Sleiman et al., Science 377, eabo3191 (2022) 


C. Lopez-Otin, M. A. Blasco, L. Partridge, M. Serrano, 

G. Kroemer, The hallmarks of aging. Cell 153, 1194-1217 
(2013). doi: 10.1016/j.cell.2013.05.039; pmid: 23746838 

C. J. Kenyon, The genetics of ageing. Nature 464, 504-512 
(2010). doi: 10.1038/nature08980; pmid: 20336132 

M. Hook et al., Genetic cartography of longevity in humans and 
mice: Current landscape and horizons. Biochim. Biophys. Acta 
Mol. Basis Dis. 1864 (9 Pt A), 2718-2732 (2018). doi: 10.1016/ 
j.bbadis.2018.01.026; pmid: 29410319 

L. N. Booth, A. Brunet, The aging epigenome. Mol. Cell 62, 
728-744 (2016). doi: 10.1016/j.molcel.2016.05.013; 

pmid: 27259204 

R. C. Taylor, A. Dillin, Aging as an event of proteostasis 
collapse. Cold Spring Harb. Perspect. Biol. 3, 004440 (2011). 
doi: 10.1101/cshperspect.a004440; pmid: 21441594 

R. A. Miller et al., An Aging Interventions Testing Program: 
Study design and interim report. Aging Cell 6, 565-575 (2007). 
doi: 10.1111/j.1474-9726.2007.00311.x; pmid: 17578509 

N. L. Nadon et a/., Design of aging intervention studies: The NIA 
interventions testing program. Age (Dordr.) 30, 187-199 
(2008). doi: 10.1007/s11357-008-9048-1; pmid: 19424842 
See supplementary materials. 
F. Macchiarini, R. A. Miller, R. Strong, N. Rosenthal, 

D. E. Harrison, “Chapter 10 - NIA Interventions Testing 
Program: A collaborative approach for investigating 
interventions to promote healthy aging” in Handbook of the 
Biology of Aging (Ninth Edition), N. Musi, P. J. Hornsby, Eds. 
(Academic Press, 2021; https://www.sciencedirect.com/ 
science/article/pii/B978012815962000010X), Handbooks of 
Aging, pp. 219-235. 

C. J. Cheng, J. A. L. Gelfond, R. Strong, J. F. Nelson, Genetically 
heterogeneous mice exhibit a female survival advantage that is 
age- and site-specific: Results from a large multi-site study. 
Aging Cell 18, e12905 (2019). doi: 10.1111/acel.12905; 

pmid: 30801953 

A. U. Jackson, A. T. Galecki, D. T. Burke, R. A. Miller, Mouse loci 
associated with life span exhibit sex-specific and epistatic 
effects. J. Gerontol. A Biol. Sci. Med. Sci. 57, B9-B15 (2002). 
doi: 10.1093/gerona/57.1.B9; pmid: 11773201 


. R. Gelman, A. Watson, R. Bronson, E. Yunis, Murine 


chromosomal regions correlated with longevity. Genetics 118, 
693-704 (1988). doi: 10.1093/genetics/118.4.693; 
pmid: 3163317 


. C. J. Bult et al., Mouse Genome Database (MGD) 2019. Nucleic 


Acids Res. 47, D801-D806 (2019). doi: 10.1093/nar/gky1056; 
pmid: 30407599 


. R. A. Miller, J. M. Harper, A. Galecki, D. T. Burke, Big mice die 


young: Early life body weight predicts longevity in genetically 
heterogeneous mice. Aging Cell 1, 22-29 (2002). doi: 10.1046/ 
j.1474-9728.2002.00006.x; pmid: 12882350 


. S. Roy et al., Gene-by-environment modulation of lifespan and 


weight gain in the murine BXD family. Nat. Metab. 3, 1217-1227 
(2021). doi: 10.1038/s42255-021-00449-w; pmid: 34552269 


. W. H. Dietz, Childhood weight affects adult morbidity and 


mortality. J. Nutr. 128 (Suppl), 411S-414S (1998). doi: 10.1093/ 
jn/128.2.41S; pmid: 9478038 

C. Maffeis, L. Tato, Long-term effects of childhood obesity on 
morbidity and mortality. Horm. Res. 55 (Suppl 1), 42-45 
(2001). pmid: 11408761 


18. 


19. 


20. 


al. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33 


34. 


35. 


36. 


37. 


38. 


39. 


M. E. Bowen, The relationship between body weight, frailty, and 
he disablement process. J. Gerontol. B Psychol. Sci. Soc. Sci. 
67, 618-626 (2012). doi: 10.1093/geronb/gbs067; 

pmid: 22967933 

D. H. Sullivan, L. Liu, P. K. Roberson, M. M. Bopp, J. C. Rees, 
Body weight change and mortality in a cohort of elderly 
patients recently discharged from the hospital. J. Am. Geriatr. 
Soc. 52, 1696-1701 (2004). doi: 10.1111/j.1532- 
5415.2004.52463.x; pmid: 15450047 

L. Sun, A. A. Sadighi Akha, R. A. Miller, J. M. Harper, Life-span 
extension in mice by preweaning food restriction and by 
methionine restriction in middle age. J. Gerontol. A Biol. Sci. 
Med. Sci. 64, 711-722 (2009). doi: 10.1093/gerona/glp051; 
pmid: 19414512 

S. Burgess et al., Dissecting causal pathways using Mendelian 
randomization with summarized genetic data: Application to 
age at menarche and risk of breast cancer. Genetics 207, 
481-487 (2017). doi: 10.1534/genetics.117.300191; 

pmid: 28835472 

P. R. Timmers et al., Genomics of 1 million parent lifespans 
implicates novel pathways and common diseases and 
distinguishes survival chances. eLife 8, e39856 (2019). 

doi: 10.7554/eLife.39856; pmid: 30642433 

L. Yengo et al., Meta-analysis of genome-wide association 
studies for height and body mass index in ~700000 individuals 
of European ancestry. Hum. Mol. Genet. 27, 3641-3649 (2018). 
doi: 10.1093/hmg/ddy271; pmid: 30124842 

J. Tyrrell et al., Height, body mass index, and socioeconomic 
status: Mendelian randomisation study in UK Biobank. BMJ 
352, i582 (2016). doi: 10.1136/bmj.i582; pmid: 26956984 

F. W. Albert, L. Kruglyak, The role of regulatory variation in 
complex traits and disease. Nat. Rev. Genet. 16, 197-212 
(2015). doi: 10.1038/nrg3891; pmid: 25707927 

E. Porcu et al., Differentially expressed genes reflect disease- 
induced rather than disease-causing changes in the 
ranscriptome. Nat. Commun. 12, 5647 (2021). doi: 10.1038/ 
s41467-021-25805-y; pmid: 34561431 

M. Bou Sleiman et al., The gene-regulatory footprint of aging 
highlights conserved central regulators. Cell Rep. 32, 108203 
(2020). doi: 10.1016/j.celrep.2020.108203; pmid: 32997995 
B. A. Benayoun et al., Remodeling of epigenome and 
ranscriptome landscapes with aging in mice reveals 
widespread induction of inflammatory responses. Genome Res. 
29, 697-709 (2019). doi: 10.1101/gr.240093.118; 

pmid: 30858345 

A. Tyshkovskiy et al., Identification and application of gene 
expression signatures associated with lifespan extension. Cell 
Metab. 30, 573-593.e8 (2019). doi: 10.1016/ 
j.cmet.2019.06.018; pmid: 31353263 

S. T. Gal-Oz et al., ImmGen report: Sexual dimorphism in the 
immune system transcriptome. Nat. Commun. 10, 4295 
(2019). doi: 10.1038/s41467-019-12348-6; pmid: 31541153 

P. Bastard et al., Autoantibodies against type | IFNs in patients 
with life-threatening COVID-19. Science 370, eabd4585 (2020). 
doi: 10.1126/science.abd4585; pmid: 32972996 

T. Takahashi et al., Sex differences in immune responses that 
underlie COVID-19 disease outcomes. Nature 588, 315-320 
(2020). doi: 10.1038/s41586-020-2700-3; pmid: 32846427 

J. P. de Magalhaes, J. Curado, G. M. Church, Meta-analysis of 
age-related gene expression profiles identifies common 
signatures of aging. Bioinformatics 25, 875-881 (2009). 

doi: 10.1093/bioinformatics/btp073; pmid: 19189975 

N. Almanzar et al., A single-cell transcriptomic atlas 
characterizes ageing tissues in the mouse. Nature 583, 
590-595 (2020). doi: 10.1038/s41586-020-2496-1; 

pmid: 32669714 
J. J. Crowley et al., Analyses of allele-specific gene expression 
in highly divergent mouse crosses identifies pervasive allelic 
imbalance. Nat. Genet. 47, 353-360 (2015). doi: 10.1038/ 
ng.3222; pmid: 25730764 
S. F. Pinter et al., Allelic imbalance is a prevalent and tissue- 
specific feature of the mouse transcriptome. Genetics 200, 
537-549 (2015). doi: 10.1534/genetics.115.176263; 

pmid: 25858912 
R. Mott et al., The architecture of parent-of-origin effects in 
mice. Cell 156, 332-342 (2014). doi: 10.1016/j.cell.2013.11.043; 
pmid: 24439386 
S. Cleary, C. Seoighe, Perspectives on allele-specific 
expression. Annu. Rev. Biomed. Data Sci. 4, 101-122 (2021). 
doi: 10.1146/annurev-biodatasci-021621-122219; 

pmid: 34465174 

R. Tacutu et a/., Human ageing genomic resources: New and 
updated databases. Nucleic Acids Res. 46 (D1), D1083-D1090 
(2018). doi: 10.1093/nar/gkx1042; pmid: 29121237 


30 September 2022 


40. 


4l. 


42. 


43. 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


pe! 


53. 


54. 


55. 


56. 


57. 


A. Battle, C. D. Brown, B. E. Engelhardt, S. B. Montgomery; 
GTEx Consortium; Laboratory, Data Analysis &Coordinating 
Center (LDACC)—Analysis Working Group; Statistical Methods 
groups—Analysis Working Group; Enhancing GTEx (eGTEx) 
groups; NIH Common Fund; NIH/NCI; NIH/NHGRI; NIH/NIMH; 
NIH/NIDA; Biospecimen Collection Source Site—NDRI; 
Biospecimen Collection Source Site—RPCI; Biospecimen Core 
Resource—VARI; Brain Bank Repository—University of Miami 
Brain Endowment Bank; Leidos Biomedical—Project 
Management; ELSI Study; Genome Browser Data Integration 
&Visualization—EBI; Genome Browser Data Integration 
&Visualization—UCSC Genomics Institute, University of 
California Santa Cruz; Lead analysts; Laboratory, Data Analysis 
&Coordinating Center (LDACC); NIH program management; 
Biospecimen collection; Pathology; eQTL manuscript working 
group, Genetic effects on gene expression across human 
tissues. Nature 550, 204-213 (2017). doi: 10.1038/ 
nature24277; pmid: 29022597 

E. Porcu et al., Mendelian randomization integrating GWAS and 
eQTL data reveals genetic determinants of complex and clinical 
traits. Nat. Commun. 10, 3300 (2019). doi: 10.1038/s41467- 
019-10936-0; pmid: 31341166 

A. V. Samuelson, C. E. Carr, G. Ruvkun, Gene activities that 
mediate increased life span of C. elegans insulin-like signaling 
mutants. Genes Dev. 21, 2976-2994 (2007). doi: 10.1101/ 
gad.1588907; pmid: 18006689 

R. Das et al., The homeodomain-interacting protein kinase 
HPK-1 preserves protein homeostasis and longevity through 
master regulatory control of the HSF-1 chaperone network and 
TORC1-restricted autophagy in Caenorhabditis elegans. PLOS 
Genet. 13, 1007038 (2017). doi: 10.1371/journal. 
pgen.1007038; pmid: 29036198 


. A. J. Dobson et al., Longevity is determined by ETS 


transcription factors in multiple tissues and diverse species. 
PLOS Genet. 15, e1008212 (2019). doi: 10.1371/journal. 
pgen.1008212; pmid: 31356597 

S. P. Curran, G. Ruvkun, Lifespan regulation by evolutionarily 
conserved genes essential for viability. PLOS Genet. 3, e56 
(2007). doi: 10.1371/journal.pgen.0030056; pmid: 17411345 
C. Y. Ewald, J. N. Landis, J. Porter Abate, C. T. Murphy, 

T. K. Blackwell, Dauer-independent insulin/IGF-1-signalling 
implicates collagen remodelling in longevity. Nature 519, 
97-101 (2015). doi: 10.1038/naturel4021; pmid: 25517099 

T. Yamagata et al., Genome organization of human 48-kDa 
oligosaccharyltransferase (DDOST). Genomics 45, 535-540 
(1997). doi: 10.1006/geno.1997.4966; pmid: 9367678 

R. D. Semba, E. J. Nicklett, L. Ferrucci, Does accumulation of 
advanced glycation end products contribute to the aging 
phenotype? J. Gerontol. A Biol. Sci. Med. Sci. 65, 963-975 
(2010). doi: 10.1093/gerona/glq074; pmid: 20478906 

P. R.H. J. Timmers, J. F. Wilson, P. K. Joshi, J. Deelen, 
Multivariate genomic scan implicates novel loci and haem 
metabolism in human ageing. Nat. Commun. 11, 3570 (2020). 
doi: 10.1038/s41467-020-17312-3; pmid: 32678081 

L. Mouchiroud et a/., Pyruvate imbalance mediates metabolic 
reprogramming and mimics lifespan extension by dietary 
restriction in Caenorhabditis elegans. Aging Cell 10, 39-54 
(2011). doi: 10.1111/j.1474-9726.2010.00640.x; 

pmid: 21040400 

P. W. Stacpoole, The pyruvate dehydrogenase complex as a 
therapeutic target for age-related diseases. Aging Cell 11, 
371-377 (2012). doi: 10.1111/).1474-9726.2012.00805.x; 

pmid: 22321732 

A. Dillin et al., Rates of behavior and aging specified by 
mitochondrial function during development. Science 298, 
2398-2401 (2002). doi: 10.1126/science.1077780; 

pmid: 12471266 

R. H. Houtkooper et a/., Mitonuclear protein imbalance as a 
conserved longevity mechanism. Nature 497, 451-457 (2013). 
doi: 10.1038/nature12188; pmid: 23698443 

T. Lima, T. Y. Li, A. Mottis, J. Auwerx, Pleiotropic effects of 
mitochondria in aging. Nat Aging 2, 199-213 (2022). 

doi: 10.1038/s43587-022-00191-2 

M. A. Aon et al., Mitochondrial health is enhanced in rats with 
higher vs. lower intrinsic exercise capacity and extended 
lifespan. NPJ Aging Mech. Dis. 7, 1-16 (2021). doi: 10.1038/ 
s41514-020-00054-3; pmid: 33398019 

P. Hanlon et al., Three-locus and four-locus QTL interactions 
influence mouse insulin-like growth factor-l. Physiol. Genomics 
26, 46-54 (2006). doi: 10.1152/physiolgenomics.00247.2005; 
pmid: 16782841 

J. M. Harper, S. J. Durkee, R. C. Dysko, S. N. Austad, 

R. A. Miller, Genetic modulation of hormone levels and life span 
in hybrids between laboratory and wild-derived mice. 


11 of 12 


RESEARCH | RESEARCH ARTICLE 


J. Gerontol. A Biol. Sci. Med. Sci. 61, 1019-1029 (2006). 

doi: 10.1093/gerona/61.10.1019; pmid: 17077194 
58. S. V. Nuzhdin, E. G. Pasyukova, C. L. Dilda, Z.-B. Zeng, 

T. F. C. Mackay, Sex-specific quantitative trait loci affecting 
ongevity in Drosophila melanogaster. Proc. Natl. Acad. Sci. 
U.S.A. 94, 9734-9739 (1997). doi: 10.1073/pnas.94.18.9734; 
pmid: 9275193 
59. M. Sadagurski, G. Cady, R. A. Miller, Anti-aging drugs reduce 
hypothalamic inflammation in a sex-specific manner. Aging Cell 16, 
652-660 (2017). doi: 10.1111/acel.12590; pmid: 28544365 
60. M. Garratt, B. Bower, G. G. Garcia, R. A. Miller, Sex differences 
in lifespan extension with acarbose and 17-a estradiol: Gonadal 
hormones underlie male-specific improvements in glucose 
‘olerance and mTORC2 signaling. Aging Cell 16, 1256-1266 
(2017). doi: 10.1111/acel.12656; pmid: 28834262 
61. R. A. Miller et a/., Rapamycin-mediated lifespan increase in 
mice is dose and sex dependent and metabolically distinct 
rom dietary restriction. Aging Cell 13, 468-477 (2014). 

doi: 10.1111/acel.12194; pmid: 24341993 
62. M. Parra-Vargas, M. Ramon-Krauel, C. Lerin, 

J. C. Jimenez-Chillaron, Size does matter: Litter size strongly 

determines adult metabolism in rodents. Cell Metab. 32. 

334-340 (2020). doi: 10.1016/).cmet.2020.07.014; 

pmid: 32814016 
63. D. L. Palliyaguru et al., Fasting blood glucose as a predictor of 

mortality: Lost in translation. Cell Metab. 33, 2189-2200.e3 

(2021). doi: 10.1016/j.cmet.2021.08.013; pmid: 34508697 
64. C. Franceschi, J. Campisi, Chronic inflammation 

(inflammaging) and its potential contribution to age-associated 

diseases. J. Gerontol. A Biol. Sci. Med. Sci. 69 (Suppl 1), S4-S9 

(2014). doi: 10.1093/gerona/glu057; pmid: 24833586 


Bou Sleiman et al., Science 377, eabo3191 (2022) 


65. J. H. Nadeau, J. Auwerx, The virtuous cycle of human genetics 
and mouse models in drug discovery. Nat. Rev. Drug Discov. 18, 
255-272 (2019). doi: 10.1038/s41573-018-0009-9; 
pmid: 30679805 

66. M. A. Bogue et al., Mouse Phenome Database: A data 
repository and analysis suite for curated primary mouse 
phenotype data. Nucleic Acids Res. 48, D716-D723 (2020). 
pmid: 31696236 

67. S. Gabriel, L. Ziaugra, D. Tabbaa, SNP genotyping using the 
Sequenom MassARRAY iPLEX platform. Curr. Protoc. Hum. 
Genet. 60, 2.12.1-2.12.18 (2009). doi: 10.1002/0471142905. 
hg0212s60; pmid: 19170031 

68. M. Bou Sleiman, Sex- and age-dependent genetics of longevity 
in a heterogeneous mouse population, Zenodo (2022); 
https://doi.org/10.5281/zenodo.6625242. 


ACKNOWLEDGMENTS 


We thank all technical and scientific members of the 
Interventions Testing Program, who have produced and made 
the data available. We also thank J.-D. Morel, G. Benegiamo, and 
L. Goeminne for comments on the manuscript. Funding: Ecole 
Polytechnique Fédérale de Lausanne (J.A.); European Research 
Council grant ERC-AdG-787702 (J.A.); Swiss National Science 
Foundation grant 31003A-179435 (J.A.); Swiss National Science 
Foundation grant 310030-189147 (Z.K.); National Institutes of 
Health grant AGO43930 (R.W., J.A.); National Institutes of 
Health grant AGO22303 (R.M.); National Institutes of Health 
grant AGO22308 (D.H.); National Institutes of Health grant 
AG022307 (R.S.); Glenn Foundation for Medical Research 
(R.M.); R.S. is supported by a Senior Research Career Scientist 
Award from the Department of Veterans Affairs Office of 


30 September 2022 


Research and Development. Author contributions: 
Conceptualization: J.A., R.W., R.M. Data curation: S.R., M.B.S. 
Formal Analysis: M.B.S., M.S., G.V.G.vA. Investigation: M.B.S., 
A.G., M.S. Methodology: M.B.S., S.R., R.W. Resources: R.M., R.S., 
J.N., D.H., R.W., J.A. Software: M.B.S. Validation: M.B.S., A.G. 
Visualization: M.B.S. Web application: M.B.S. Writing - original 
draft: M.B.S., J.A. Writing - review and editing: M.B.S., J.A., 
R.W., R.M., Z.K. Funding acquisition: J.A., R.W., R.M., J.N., D.H., 
R.S. Project administration: R.W., J.A. Supervision: J.A., R.W. 
Competing interests: The authors declare that they have no 
competing interests. Data and materials availability: All data 
and code are available at Zenodo (68). Raw RNA-seq data are 
available on GEO (GSE194321). License information: Copyright 
© 2022 the authors, some rights reserved; exclusive licensee 
American Association for the Advancement of Science. No claim 
to original US government works. https://www.science.org/ 


about/science-licenses-journal-article-reuse 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.abo3191 
Materials and Methods 

Figs. Sl to S6 

Tables S1 and S2 

References (69-93) 

Data S1 to S6 


Submitted 27 January 2022; resubmitted 9 June 2022 
Accepted 10 August 2022 
10.1126/science.abo3191 


12 of 12 


RESEARCH 


RESEARCH ARTICLE SUMMARY 


DRUG DESIGN 


Structure-based discovery of nonopioid analgesics 
acting through the a2,-adrenergic receptor 


Elissa A. Fink}, Jun Xu}, Harald Hiibner{, Joao M. Braz}, Philipp Seemannt, Charlotte Avet, Veronica Craik, 
Dorothee Weikert, Maximilian F. Schmidt, Chase M. Webb, Nataliya A. Tolmachova, Yurii S. Moroz, 

Xi-Ping Huang, Chakrapani Kalyanaraman, Stefan Gahbauer, Geng Chen, Zheng Liu, Matthew P. Jacobson, 
John J. Irwin, Michel Bouvier*, Yang Du*, Brian K. Shoichet*, Allan |. Basbaum*, Peter Gmeiner* 


INTRODUCTION: Epidemics of pain and opioid 
abuse underscore the need for new nonopioid 
therapeutics to treat pain. Many nonopioid 
receptors are involved in pain processing (noci- 
ception), but only a few have been validated 
therapeutically. Of particular interest is the 
Oya-adrenergic receptor (o.2,AR), a G protein- 
coupled receptor (GPCR) whose activation in 
the central nervous system has pain-relieving 
effects. The known therapeutics targeting the 
Oo,AR, like clonidine and dexmedetomidine, 
are known to be analgesic. They are also 
strongly sedating, which is important for the 
primary indication of dexmedetomidine. This, 
however, has restricted the use of these drugs 
to hospital settings and kept them from being 
used in broader patient populations. 
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RATIONALE: Because GPCRs, like a2,AR, can 
signal into the cell through multiple down- 
stream effectors, we reasoned that agonists 
that were chemically dissimilar to the highly 
related dexmedetomidine, clonidine, and bri- 
monidine might have different signaling and 
might be able to separate sedation from 
analgesia. We sought these chemotypes among 
a virtual library of more than 301 million 
diverse, readily accessible molecules in the 
ZINC15 library (http://zinc15.docking.org), few 
of which have been previously synthesized. 
We computationally docked each virtual mol- 
ecule into the highly similar o.gAR binding 
site, prioritizing those that physically fit 
and that were chemically unrelated to the 
known drugs. 
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Newly identified a2,AR agonists are analgesic without sedation. More than 301 million molecules were 
docked against the activated azgAR. Experimental testing identified a2,AR agonists with diverse chemical 
scaffolds. The experimental structure of the ‘9087-ca2,AR complex superposed closely to the computational 
prediction. The newly discovered agonists had efficacy in an in vivo neuropathic pain model (top right) without 
sedation, unlike dexmedetomidine (DEX) (bottom right). Gi;-activation ECs9 (nanomolar) and Emax (percentage) 
values are shown. Single-letter abbreviations for the amino acid residues are as follows: D, Asp; F, Phe; |, lle; S, 
Ser; V, Val; and Y, Tyr. ns, not significant; *P < 0.05; ****P < 0.0001; SNI, spared nerve injury. 
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RESULTS: From the high-ranking docked com- 
pounds, we selected 48 for de novo synthesis 
and testing. Against the o2gAR used in the 
virtual docking screens, 30 molecules bound 
for a 63% hit rate, among the highest to date 
for docking campaigns. Seventeen further 
bound to o.,AR with binding constants in 
the low-nanomolar to low-micromolar con- 
centration range. Several acted as full or par- 
tial agonists of o2,AR, activating the receptor. 
Among these was ‘9087 [mean effective con- 
centration (ECs9) of 52 nM]. Notably, the 
docking-derived agonists preferentially ac- 
tivated G;, G,, and G, G protein subtypes, 
which contrasts with known drugs, like 
dexmedetomidine and brimonidine, that acti- 
vate a much broader set of G proteins and 
recruit B-arrestins. Thus, the new agonists 
activate a more selective set of cellular path- 
ways than the known o.,AR drugs, some- 
thing we had hoped for when prioritizing 
new chemotypes. 

The structures of two of these agonists were 
experimentally determined in complex with 
the activated state of a 2,AR. These experi- 
mental ligand geometries closely corresponded 
to computational predictions. They also tem- 
plated the optimization of the initial docking 
hits and led to more potent analogs, including 
PS75 (ECs9 4.8 nM). The physical features of 
these agonists allowed them to reach high 
brain concentrations after systemic dosing. 
In animal behavioral assays, six of these pre- 
viously uncharacterized agonists relieved pain 
behaviors in neuropathic, inflammatory, and 
acute thermal nociception assays. Gene muta- 
tion and reversal of receptor binding with an 
a,AR antagonist confirmed that analgesia oc- 
curred primarily through o2,AR. Crucially, 
when compared with dexmedetomidine, none 
of the new compounds caused sedation, even 
at substantially higher doses than required 
for pain relief. 


CONCLUSION: The separation of analgesic prop- 
erties from sedation of the new agonists is 
important for further o.,AR drug develop- 
ment. The newly identified agonists, espe- 
cially ‘9087 and PS75, overcome the sedation 
liability of the previously known drugs, and 
several are orally bioavailable. This makes 
them lead molecules for the development of 
nonopioid pain therapeutics. 
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Because nonopioid analgesics are much sought after, we computationally docked more than 301 million 
virtual molecules against a validated pain target, the a2,-adrenergic receptor (a2,AR), seeking new 
azaAR agonists chemotypes that lack the sedation conferred by known a@2,AR drugs, such as 
dexmedetomidine. We identified 17 ligands with potencies as low as 12 nanomolar, many with partial 
agonism and preferential G; and G, signaling. Experimental structures of @2,AR complexed with two of 
these agonists confirmed the docking predictions and templated further optimization. Several 
compounds, including the initial docking hit ‘9087 [mean effective concentration (ECs) of 52 
nanomolar] and two analogs, ‘7075 and PS75 (ECs 4.1 and 4.8 nanomolar), exerted on-target analgesic 
activity in multiple in vivo pain models without sedation. These newly discovered agonists are interesting 
as therapeutic leads that lack the liabilities of opioids and the sedation of dexmedetomidine. 


pidemics in pain (J) and in opioid-use 

disorder (2, 3) have inspired a search 

for nonopioid analgesics (/, 4). The oa,- 

adrenergic receptor (024AR) is anonopioid 

receptor targeted by dexmedetomidine, 
a sedative that also has strong analgesic ac- 
tivity (5). Although dexmedetomidine has 
many advantages in emergency room and inten- 
sive care settings, its strong sedative effects 
(6, 7) and its lack of an oral formulation (8) 
have limited its broad use as an analgesic. These 
properties are barriers for future therapeutics 
targeting this receptor. 
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Most o2,AR analgesics are chemically related, 
and the relationship of their sedative to their 
analgesic properties is unclear. To find ther- 
apeutics with new pharmacology, we sought 
new O,AR chemotypes, topologically unrelated 
to known o2,AR agonists. The o2p-adrenergic 
receptor (G2pAR) active-state structure (9) 
became available, and its binding site is highly 
conserved compared with that of a2,AR (fig. 
S1); therefore, it should be possible to identify 
new d».,AR agonists by structure-based dock- 
ing. Meanwhile, the advent of readily accessible 
make-on-demand (“tangible”) molecules (0-12) 
ranging from hundreds of millions (10, 13, 14) 
to more than a billion molecules (75, 16) has 
vastly increased the chemotypes available for 
ligand discovery. Docking these libraries has 
revealed new chemotypes with 20 to 60% hit 
rates (13, 14, 17-20) and sometimes nanomo- 
lar potencies for a growing range of targets 
(0, 13, 14, 18, 21-24), often with new pharma- 
cology (0, 13, 17, 25). Therefore, we targeted 
the o.pAR with an ultralarge library docking 
screen. 


Docking 301 million molecules versus the a2gAR 


The ZINC15 and ZINC20 virtual libraries are 
composed of millions to billions of tangible 
molecules, depending on the molecular prop- 
erty range targeted, and are accessed by com- 
bining hundreds of thousands of diverse 
building blocks through hundreds of well- 
characterized reactions (10-12). Most of the 
molecules have not previously been synthe- 
sized and range in mass, calculated LogP (cLogP) 
values (a measure of hydrophobicity), and for- 
mal charge. Given the small size of the o2gAR 
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orthosteric site, we docked both the 20 million 
fragment-like [compounds with smaller masses 
of <250 atomic mass units (amu)] and 281 
million lead-like (compounds with larger mass- 
es of 250 to 350 amu) molecules from the 
ZINC15 library (both sets having cLogP < 3.5) 
(1) (Fig. 1A). More than 233 trillion com- 
plexes, an average of 452,000 per molecule, 
were sampled by DOCK3.7 and scored with 
its physics-based energy function (26) across 
three separate screens (two fragment screens 
with different variables and one lead-like screen; 
see Materials and methods). For each screen, the 
top 300,000 docking-ranked compounds were 
clustered for topological similarity and then 
filtered to identify scaffolds dissimilar to known 
agonists using an extended connectivity finger- 
print (ECFP4). These known agonists were drawn 
from the International Union of Basic and Clinical 
Pharmacology (T(UPHAR)-British Pharmaco- 
logical Society (BPS) database (27) and from 
the literature (28-31). Ligands with internal 
torsional strain were deprioritized (32). An 
additional novelty filter removed molecules 
similar to annotated o.,AR compounds in 
CHEMBL29 (28). Of the remaining top-ranked 
cluster representatives, 5000 from each frag- 
ment screen and 20,000 for the lead-like screen 
were manually evaluated in UCSF Chimera 
chttps://rbvi-ucsf.edu/chimera) for key polar 
and nonpolar interactions with o. AR (9), 
including with D92°°", F4127°9, F387°>), 
391°", and F388°? [residues conserved in 
Oo,AR: D128”, F427"9, F405°°! v4.09, 
and F406°°?; superscripts use Ballesteros- 
Weinstein and the G protein-coupled recep- 
tor database (GPCRdb) nomenclature (33)]. 
Most o.,AR agonists, and certainly the clin- 
ically used dexmedetomidine and clonidine, 
are fragments (27), and the docking results 
reflected this. The docked fragment molecules 
fit in the orthosteric site, making key contacts 
with the receptor, whereas molecules in the 
lead-like screen generally did not fit in the 
small cavity (Fig. 1A). Accordingly, most se- 
lected ligands came from the fragment dock- 
ing screens. 

From the 64 high-ranking docked com- 
pounds prioritized by visual inspection and 
purchased for in vitro testing, 48 were suc- 
cessfully synthesized—44 fragments and 4 
lead-like molecules. Compounds were first 
tested for binding to the human a2gAR re- 
ceptor, the structure used in docking screens. 
Thirty molecules of the 48 tested had binding 
constant (K;) values less than 10 uM (table S1). 
This 63% hit rate is among the highest for a 
docking campaign to date (0, 14, 21, 23, 34). 
In radioligand competition assays, compound 
ZINC1173879087 (from here on referred to as 
9087) had a K; of 1.7 nM; the remaining 29 had 
K;, values ranging from 60 nM to 9.4 uM, 
which is relatively potent for initial docking 
hits. Ten compounds (21%) had K; values 
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Fig. 1. Newly discovered a@2,AR agonists from ultralarge library docking. 
(A) 301 million molecules were docked against the active state of o2gAR (PDB 
6K41). Lead-like molecules (pink carbons) often spilled out of the orthosteric 
site, whereas fragment molecules (green carbons) are well complemented 
by that site. Hit rates were determined with a kK; cutoff of 10 uM. Eypw, van der 
Waals; Egs, electrostatic; E,ps, ligand desolvation. (B) The aAR pharmacophore 
model (9) overlaid on known ozaAR agonists dexmedetomidine, clonidine, 
and norepinephrine and new agonists from docking (colors represent the 
different moieties fulfilling the same role). (C) G; activation and B-arrestin-2 
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recruitment for norepinephrine (NorEpi), dexmedetomidine (dex), clonidine 
(clon), and several of the newly discovered docking agonists. (D) Docked 
poses of these new agonists with hydrogen bonds to key recognition residues 
of ozgAR shown as black dashed lines. For (C), data are means + SEMs of 
normalized results (n = 4 to 17 measurements for G and n = 3 to 8 
measurements for B-arrestin-2). Single-letter abbreviations for the amino 
acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; 

H, His; I, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; 

T, Thr; V, Val: W, Trp; and Y, Tyr. 
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A 


Fig. 2. Docking-predicted poses of ‘9087 and ‘4622 superpose well on the 
cryo-EM structures of the ‘9087-a2,AR-G,, and ‘4622-a2,AR-G,, 
complexes. (A and C) Cryo-EM structure of the ‘9087-o2,AR-Go, (A) and 
‘4622-012~AR-Goa (C) complexes. (B) Experimental ‘9087 structure (pink 
carbons) superposed on the docked pose (orange carbons) (PDB 7W6P; RMSD 


below 1 uM (table S1). The compounds were 
then tested for binding to the murine o2,AR, 
again by radioligand competition. Of these, 
17 (85%) had a K; better than 10 uM, with 
affinities ranging from 72 nM to 9.4 uM; five 
compounds (10%) had K; values below 1 uM 
(table S1). Against human o2,AR, the highest 
affinity was 12 nM (table S2). 


Discovery of new a2,AR, partial G; and 
G, (Gi/.) agonists 


In functional assays, most of the potent binders 
were partial or full agonists for o2,AR and 
QopAR (Fig. 1, B to D; tables S1 and S2; and 
figs. S2 to S4); few antagonists were found 
among the more potent docking hits. This 
reflects the targeting of the activated state of 
the receptor (35, 36) and was a goal of the 
screen. The best four agonists from the docking 
screen include ‘9087 as well as ZINC1240664622, 
ZINC1242282998, and ZINC001242890172 (from 
here on referred to as ‘4622, ‘2998, and ‘0172, 
respectively), with the o2,AR-mediated G; acti- 
vation maximum effect (E,;yax) ranging from 
60 to 95% of norepinephrine response and 
mean effective concentration (EC;9) values of 
9.7 to 210 nM in Go; bioluminescence resonance 
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energy transfer (BRET) assays (Fig. 1C and table 
$2). We tested the effect of receptor expres- 
sion in cells on the functional properties of the 
partial agonist, ‘9087, and ultimately of two 
optimized analogs, “7075 and PS75; all three 
remained potent G; partial agonists, with E,,ax 
decreasing with receptor expression (fig. S5). 
In an orthogonal cyclic adenosine monophos- 
phate (cAMP) assay, ‘9087 was a partial agonist 
with an ECs, of 87 nM and Eynax Of 42%, which 
is broadly consistent with the BRET assay (from 
here on, G; activities are the Ga; BRET assay 
values, unless otherwise noted) (Fig. IC, fig. 
S4, and table S2). 

The docking agonists had strong differential 
activity for G; activation compared with re- 
cruitment of B-arrestin-2. Although this was 
also true of the known agonists, dexmedeto- 
midine and clonidine, for the new agonists 
the difference was accentuated so that arrestin 
recruitment was almost completely eliminated 
at relevant concentrations. Of the four best 
docking agonists, only ‘0172 had a measurable 
efficacy for B-arrestin-2 recruitment, but only 
with 22% of the E,,,. of norepinephrine and 
with weak potency (ECs, 1.7 uM); for the other 
three, f-arrestin-2 recruitment was negligible 
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1.14 A). Hydrogen bonds and ion pairs are shown with dashed black lines to 
F4277° and D128°°, respectively. (D) Experimental ‘4622 structure (green 
carbons) superposed on the docked pose (orange carbons) (PDB 7W7E; RMSD 
1.14 A). Hydrogen bond shown with dashed black lines to D128°“*. For (B) and 
(D), side chains of o24AR residues within 4 A of ligands are shown as sticks. 


(Fig. 1C and table $2). We note that this lack 
of arrestin recruitment could reflect the partial 
agonism of the new agonists combined with the 
weaker coupling of the arrestin pathway versus 
the well-coupled G; pathway, as indicated by the 
differences in potency and efficacy of the refer- 
ence agonists, dexmedetomidine and clonidine. 
Agonists of o.2,AR, including its endogenous 
ligand norepinephrine, also activate other G 
protein pathways (37). Accordingly, we used the 
enhanced bystander BRET (ebBRET)-based ef- 
fector membrane translocation assay (EMTA) 
(38) to test ‘9087 and its analogs, “7075 and PS75, 
against a more expansive panel of G protein and 
B-arrestin subtypes. The docking compounds 
preferentially activated G;, G,, and G, (Giyo/z) 
signaling, whereas known agonists norepi- 
nephrine, dexmedetomidine, and brimonidine 
strongly activated multiple additional G pro- 
teins and f-arrestins (figs. S6 and S7 and tables 
S3 and S4). Receptor internalization after treat- 
ment with compound was also investigated by 
monitoring disappearance of o2,ARs from the 
plasma membrane (d2,AR-RlucII/rGFP-CAAX 
biosensor) and relocalization of the receptors 
in endosomes (2,AR-RlucII/rGFP-FYVE bio- 
sensor) (39). Known agonists brimonidine and 
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Fig. 3. Structure-based optimization of ‘9087. (A) Strategies for analoging 
‘9087 (left). Analogs of the pyridine, exocyclic nitrogen, and lipophilic nature 
of the bicyclic ring revealed their importance for ‘9087 activity (middle). 
Sampling alternate lipophilic bicyclic rings and modifying their substituents 
identified eight more potent agonists (right). ECs59 values are shown for G; 
activation. (B) G; and B-arrestin-2 recruitment for ‘9087 and its two most 
potent analogs, ‘7075 and PS75. (C) Modeled poses of ‘7075 (pink carbons) 


norepinephrine show comparable responses for 
both biosensors, whereas dexmedetomidine has 
about half of this response. Consistent with their 
absence of B-arrestin recruitment, we found no 
effect of ‘9087, “7075, and PS75 on disappearance 
from the plasma membrane and marginal ef- 
fect at the highest concentrations on endosomal 
relocalization (fig. S8). Although such functional 
selectivity was not explicitly modeled in the 
docking, it likely results from the new chem- 
istry, which was explicitly required (13, 14, 17). 

Comparing the new agonists with dexme- 
detomidine, clonidine, norepinephrine, and a 
previously described pharmacophore model 
for aAR selective agonists (9), both similar and 
distinct features emerge (Fig. 1B). The phar- 
macophore model for known agonists and 
the new docking compounds both have basic, 
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agonists are dominated by imidazoles (un- 
saturated or partially saturated), whereas the 
docking compounds have diverse nonimid- 
azole rings. Both sets of compounds contain 
additional moieties off of a second aryl ring, 
typically two substituents for the known 
agonists; however, for the docking-derived 
compounds, these vary from bulky hydrophobic 
rings, to hydrophilic rings, to single substituents, 
to having no substituents off of the aryl ring at 
all. Not all of the docking compounds have an 
exocyclic linker as described in the pharmaco- 
phore model. The protonated imidazole of 
known agonists ion pairs with D92**? and 
hydrogen bonds to the backbone of F412”°° 
of O2pRAR (9, 40) (fig. S1). Although several of 
the docking-derived compounds also interacted 
with both D922” and F412”*°, they did so with 
different heterocyclic rings (Fig. 1D). 
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and PS75 (blue carbons) based on the ‘9087-o2,AR cryo-EM structure with 
substituents oriented toward open space in the orthosteric site. Hydrogen 
bonds and ionic interactions are shown with dashed black lines to F42 
and D128°*, respectively. For (A), G; and B-arrestin-2 recruitment data 
for analogs are shown in figs. S14 and S15 and table S8. For (B), data are 
means + SEMs of normalized results (n = 7 to 17 measurements for G; 
and n = 4 to 8 measurements for B-arrestin-2). 


77.39 


To test the docking model and to template 
structure-based optimization, we determined 
the structure of the ‘9087-o02,AR-Go, and 
‘4622-d2,AR-G oq complexes at a nominal 
resolution of 3.47 and 3.38 A, respectively, 
using single-particle cryo-electron microscopy 
(cryo-EM) (Fig. 2, A to D, and figs. S9 and S10). 
The predicted docked pose superimposes on the 
cryo-EM result of ‘9087 with a 1.14-A all-atom 
root mean square deviation (RMSD) of the 
agonist; the docking-predicted interactions 
are recapitulated in the experimental structure 
(Fig. 2B). The interactions between ‘9087 and 
QoaAR differ from that of norepinephrine but 
resemble those of imidazoline-containing ago- 
nists (9, 40). ‘9087 interacts with o2,AR mainly 
through van der Waals and aromatic inter- 
actions to transmembrane helices (TM) 3, 5, 
6, and 7 and 1205*°**? of extracellular loop 
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2 (ECL2). It also forms an ionic interaction 
with the conserved D128**”, and although this 
interaction is relatively distant at 3.6 A, it is 
similar to those of norepinephrine (40) and 
dexmedetomidine (9) that are 3.0 and 3.7 A 
from D128**”, respectively (Fig. 2B). As in the 
docking prediction, the basic, formally cat- 
ionic nitrogen of ‘9087 is not oriented toward 
D128” to form a salt bridge (Fig. 2B and fig. 
S1D, as seen in norepinephrine, but instead 
hydrogen bonds with the backbone carbonyl 
of F427"°°, as it does in dexmedetomidine 
(9). The bridging exocyclic and formally neutral 
amine of the ‘9087 ion pairs with D128?. 
Typically for aminergic G protein-coupled re- 
ceptors (GPCRs), the conserved hydrogen bond 
with D128”? would be made by the stronger 
base (9, 17, 20, 40). In fact, the formal charge of 
‘9087 after protonation of the pyridine moiety 
is almost equally shared between the two 
nitrogens, as calculated by semiempirical quan- 
tum mechanics and as reflected in the docking 
model (fig. S11). For ‘4622, the docked pose is 
also in good agreement with the cryo-EM result 
with an all-atom RMSD of 1.14 A; ‘4622 forms 
a 3.4-A hydrogen bond to D128” and makes 
several hydrophobic interactions (Fig. 2D). 
Both “4622- and ‘9087-bound structures have 
similar receptor-G,, interfaces to other ligand- 
bound o,AR-G protein complexes (figs. S9 
and S10). 

The interactions observed in the ‘9087 and 
‘4622 receptor complexes, and in the modeled 
pose of analog “7075, were tested by residue 
substitution for effects on G; activation and 
B-arrestin recruitment (figs. $12 and S13 and 
tables S6 and S7). Consistent with the ob- 
served ion pair with D128**”, norepinephrine 
and dexmedetomidine are highly sensitive 
to substitutions to D128”", with an almost 
complete loss of G; activation and f-arrestin 
recruitment. For dexmedetomidine, the G; 
ECs9 is 170,000-fold higher for activation of 
D128°*?A. Although the G, activity of 9087 
and ‘7075 is also diminished in the D128?*” 
mutant receptors, potency only falls by ~200- 
to 1600-fold. By contrast, the G; activity of 
‘9087, “7075, and ‘4622 is eliminated in the 
F427"°°4 mutant. The backbone carbonyl of 
F427"°° hydrogen bonds with ‘9087, whereas 
its aromatic side chain stacks with the agonist 
in the cryo-EM structure, perhaps indicating 
formation of a cation-pi interaction between 
the pyridine of ‘9087 and F427", as previ- 
ously suggested for agonist-induced a,AR 
activation (47). Meanwhile, dexmedetomi- 
dine and especially norepinephrine, which 
lack these interactions, are less affected by this 
mutant (Fig. 2B). Mutations of Y409°”° greatly 
affect norepinephrine, increasing (weakening) 
G; ECs values 500- to 10,000-fold, likely dis- 
rupting a key hydrogen bond (40, 41); the im- 
portance of position 6.55 was previously observed 
in agonist-induced BAR activation (42-44). The 
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potencies of ‘9087 and dexmedetomidine are 
only modestly worse in Y409°° mutants, and 
for ‘9087, the GiEmax is even slightly increased. 
For ‘4622 and “7075, most substitutions 
diminished activity, with the exception of 
§215°*"A, which slightly increased the agonist 
activity of “7075 and ‘4622 in the G; activa- 
tion and B-arrestin recruitment assays and 
hardly influenced ‘9087 and dexmedetomi- 
dine. By contrast, $215°*”A negatively affected 
norepinephrine-induced receptor activation, 
consistent with previous studies on direct inter- 
actions of full agonists and $215°*° (40, 41). The 
Y431""A and F mutations overall influenced 
B-arrestin recruitment of norepinephrine more 
than G; signaling. This has been previously 
observed, leading to the proposal that direct 
hydrogen bonding between the agonist and 
the residue at position 7.43 could more tightly 
couple TM7 and thereby play a role in B-arrestin 
signaling (40). Taken together, the differential 
responses to the residue substitutions supports 
suggestions from the structures that the new 
agonists, although binding in the same overall 
site as the canonical agonists, interact in mean- 
ingfully different ways, with potential implica- 
tions for differential receptor signaling. 


Optimization of docking hit ‘9087 


To optimize ‘9087, we adopted two strategies. 
First, we used classic medicinal chemistry 
hypothesis-testing and generation of analogs 
to investigate and improve key recognition 
features. We looked for possible analogs by 
similarity-searching among 1.4 billion and 
12 billion tangible molecules using the Arthor 
and SmallWorld programs (12) (NextMove 
Software, UK). From these searches, we docked 
prioritized subsets, leading to 13 of the 19 ‘9087 
analogs that we ultimately selected (Fig. 3A and 
table S5). Another six analogs were designed 
to probe particular protein-ligand interac- 
tions. Analogs were also investigated around 
compounds ‘2998, ‘4622, and ‘0172 (table S5). 
In the ‘9087 series, perturbing the pyridine 
eliminated activity and confirmed the impor- 
tance of the cationic character and the impor- 
tance of the hydrogen bond with F427" (Fig. 
3A, figs. S14 and S15, and table S8). 

The most potent analogs emerged from 
variations of the isoquinoline ring in ‘9087. 
Proximal hydrophobic residues F405°, F406°°?, 
409°, and 1205*°” do not make obvious polar 
interactions with the isoquinoline nitrogen of 
‘9087 (Fig. 2B). Accordingly, we added small 
nonpolar groups, like the chlorine in ‘1718, or 
changed the isoquinoline to a different bicyclic 
system, like the benzothiophene in ‘4914 or 
naphthalene in ‘5879. A set of analogs also 
had the isoquinoline-to-naphthalene change 
but with a single substituent added at two 
different positions of the naphthalene, as in 
‘4825 and PS83. Overall, this set of analogs 
resulted in five potent agonists (ECs> 4.1 nM 
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to 18 nM) (data supporting the ‘9087 opti- 
mization are summarized in Fig. 3, A and B; 
figs. S14 and S15; and tables $2, $5, and S8). 
‘7075 was the most potent full agonist in the 
‘9087 series with 13-fold increased potency in 
the BRET G; activation (ECs9 4.1nM, Emax 93%) 
and cAMP assays (ECs 18 nM, Emax 96%) (Fig. 
3B, fig. S4, and table $2). It preferentially 
activated G;/. signaling over other G protein 
subtypes and B-arrestins and caused no re- 
ceptor internalization (figs. S6 to S8 and tables 
S3 and S4). 

Our second strategy for ligand optimization 
was purely structure based, using the newly 
determined ‘9087-02,AR complex. Molecules 
were prioritized for their favorable docked 
pose in a ligand-free version of the o2,AR-9087 
structure or were designed to improve protein- 
ligand interactions based on the ‘9087-alpha2a 
cryo-EM structure, leading us to synthesize 
eight further analogs. Two derivatives of ‘9087 
(PS84 and PS86) confirm the importance of 
the lipophilic and aromatic properties of the 
bicyclic moiety for o2,AR binding and activa- 
tion, facilitating favorable interactions with 
the aromatic residues F405°°", F406°°”, and 
Y409°°° (Fig. 2B). Other substitutions confirmed 
the importance of the ion pair to D128?” and of 
the interaction between the protonated pyridine 
of ‘9087 and the backbone carbonyl atom of 
F427"°° (PS92, PS93), as also shown by the 
mutational analysis (figs. S12 and S13 and 
tables S6 and S7). 

Seeking more potent analogs, we focused 
on derivatizing the lipophilic substituents of 
‘9087, building off analogs ‘5879 and ‘7075. 
Assuming the same binding mode for the 
naphthalene derivative ‘5879 as for ‘9087 in 
the cryo-EM structure, unexploited space be- 
tween the ligand and the receptor in the 
orthosteric site was revealed in positions 5 
and 7 of the bicyclic moiety of ‘5879 (R’ and 
R’ in Fig. 3A). We first tried to fill available 
space with substituents of different sizes at 
the R’ position of ‘5879. From largest to smallest, 
substituents were methoxy (PS71), chlorine 
(PS75), and fluorine (PS70). The potency of 
PS75 for G; activation was similar to that of 
‘5879 (ECs values of 4.8 nM and 6.1 nM, re- 
spectively), PS70 (ECs) 36 nM) did not im- 
prove activity, and PS71 had 16-fold worse 
activity compared with ‘5879; for PS71, this 
may reflect entropic and desolvation penal- 
ties and a repulsive interaction for the bulky 
methoxy substituent that the receptor was 
unable to accommodate. Similar to “7075 with 
a substituent at the R? position of the bicyclic 
moiety, addition of a methyl (PS83) had a 
similar ECs, of 13 nM to that of °7075 and ‘5879 
(ECso values of 4.1 nM and 6.1 nM, respectively). 

PS75 was the most potent analog to arise 
from the second round of optimization. The 
molecule was a full agonist with 11-fold im- 
proved potency (ECs 4.8 nM, Emax 82%) for G; 
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activation compared with ‘9087, and it more 
potently activated Gj/,/, subtypes than ‘9087. 
Meanwhile, PS75 still retained the preferen- 
tial signaling through the Gj,, versus other G 
protein families and f-arrestins and again led 
to negligible receptor internalization (figs. S5 
to S8 and tables S3 and S4). In the modeled 
pose of PS75, the chlorine substituent is 
oriented toward the open space below its 
naphthalene ring toward T133°” (Fig. 3C). 
Its potency and efficacy make PS75 a lead 
molecule for treatment in pain. 


Newly discovered a2,AR agonists are 
analgesic with reduced side effects 


In preparation for in vivo studies, we inves- 
tigated the selectivity and pharmacokinetic 
properties of our most potent agonists. ‘9087 
activated only a few of the 320 GPCRs screened 
(45) (fig. S16A). Only the dopamine D, receptor 
(D2R) had weak activity in secondary assays, 
with EC;, values of 4.5 uM and 16 uM in G 
protein signaling and f-arrestin recruitment, 
respectively (fig. S16B). ‘9087 did not measur- 
ably activate the u-opioid receptor (uOR) nor 
did it inhibit the human ether-a-go-go-related 
gene (hERG) at concentrations below 10 uM 
(fig. S16, C and D). In binding experiments to 
other adrenergic receptors, ‘9087 bound to the 
Qc-subtype at mid-nanomolar concentration 
and to other o,-subtypes in the 1-to-10-M range 
(table S9). The molecule had no measurable 
binding for B-adrenergic receptors up to 10 uM. 
Against the imidazoline-2 receptor (I2R), a 
common off-target of o2,AR agonists, ‘9087 
bound with a K; of 300 nM, showing a modest 
sixfold selectivity for the o2,AR, whereas a few 
docking-derived compounds actually had higher 
affinities for I2R than for the o2,AR (fig. SIGE). 

Computational models suggested that ‘9087, 
‘4622, “7075, and ‘2998 would all have good 
physiologic permeability, consistent with their 
small size, low topological polar surface area, 
and weakly basic character (table S10). Consis- 
tent with this prediction, on 10-mg/kg intra- 
peritoneal (i-p.) injection in mice, the first 
three compounds, especially, had high brain 
and cerebrospinal fluid (CSF) exposure, indi- 
cating that the compounds are likely to reach 
centrally acting o2,ARs (table S10). ‘9087 
reached a similar maximum concentration 
(Cmax) in the CSF as did ‘7075, both of which 
were fourfold greater than the C,,,, of PS75, 
and ‘9087 had a 12- to 20-fold higher area 
under the concentration-time curve (AUC) in 
the CSF compared with either ‘7075 or PS75; 
CSF concentrations are often used as a proxy 
for fraction unbound in the brain (46). En- 
couragingly, ‘9087 reached high brain ex- 
posure after both intravenous (i.v.) and oral 
(p.o.) administration, with AUC values of 
420,000 ng min mL” and 2,540,000 ng min 
mL’ (table S10). The oral bioavailability was 
higher than 100%, which may reflect metabolic 
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saturation at nonequal i.v. and p.o. doses, or 
enterohepatic recirculation (table S10); this 
merits further investigation. PS75 was fully bio- 
available, though, as an analog of ‘9087, the 
same caveats apply. We also investigated the 
metabolic stability of ‘9087, “7075, and PS75 
in male rat liver microsomes. All three com- 
pounds remain largely unmodified after 1 hour, 
with ‘9087 having lower clearance and a higher 
half-life than its two analogs (fig. S17). 

Given their selectivity and high brain ex- 
posures, we tested the more potent agonists 
for pain-relief after systemic dosing (Fig. 4, A 
to G). Initial doses were chosen to be less than 
the 10 mg/kg dose used in pharmacokinetics 
studies owing to favorable CSF and brain 
properties. We started with ‘9087, the initial 
docking hit, and evaluated analogs as they 
emerged from compound optimization. With 
naive (uninjured) mice, ‘9087 did not increase 
baseline mechanical withdrawal thresholds, 
something observed with many antipain medi- 
cations, which often only have an antinoci- 
ceptive effect in the presence of pain. We then 
investigated the activity of ‘9087 in a mouse 
model of neuropathic pain, in which partial 
peripheral nerve injury invokes profound me- 
chanical hypersensitivity (47). Systemic subcuta- 
neous (s.c.) injections of ‘9087 dose-dependently 
increased the mechanical thresholds of spared 
nerve injury (SNI) mice, with a sharp increase in 
activity from 3 mg/kg to 5 mg/kg, at which point 
the effects plateaued (Fig. 4A). Lower doses were 
antiallodynic, returning mechanical thresholds 
to preinjury levels, whereas the higher doses 
were genuinely analgesic, generating mechan- 
ical thresholds substantially higher than 
baseline, preinjury levels. ‘9087 also increased 
thermal latencies in the complete Freund’s 
adjuvant (CFA)-mediated inflammatory pain 
model, which indicates that the molecule is 
effective in both tissue and nerve injury-induced 
pain models (Fig. 4G). ‘9087 also increased 
withdrawal latencies in the hot plate (55°C) 
and tail flick (50°C) assays of acute thermal 
(heat) pain (Fig. 4, E and F). Consistent with 
its relatively high exposure on oral dosing, 
this molecule also conferred a dose-dependent 
antiallodynic effect when delivered orally in 
the SNI neuropathic pain model (Fig. 4A). 
Doses up to 20 mg/kg of ‘9087 did not reduce 
the ability of the mice to perform in the 
rotarod test, which contrasts with the complete 
sedation of a dexmedetomidine dose of 60 ng/kg 
(Fig. 4H). This finding is an important differen- 
tiator for the new series and indicates that the 
analgesic effects of ‘9087 are not the result of 
motor impairment. 

We also investigated the mechanistic bases 
for the analgesia of the new o2sAR agonists, 
both pharmacologically and genetically. Phar- 
macologically, the analgesic effect of ‘9087 was 
reversed by a systemic injection of the well- 
known o.AR antagonist, atipamezole (2 mg/ 
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kg; administered 15 min before ‘9087) (Fig. 
4C). Because atipamezole has broad activity 
against the o.AR receptor subtypes and imidaz- 
oline receptors (48), we also tested ‘9087 in 
mice that express an inactive form of the o2,AR 
(point mutation D79N) (5, 49-51). D79N mutant 
mice were tested in the tail flick (50°C) assays. 
As previously reported, dexmedetomidine no 
longer induced analgesia in the mutant mice 
(52), and, as a control, the analgesia conferred 
by a 10-mg/kg dose of morphine was not sig- 
nificantly altered by the mutation (Fig. 4D). 
Consistent with activity through o2,AR, the 
analgesia conferred by ‘9087 was reduced by 
>50% back to baseline in the D79N mutant 
mice (Fig. 4D). Although most of the antinoci- 
ceptive activity appears to derive from activity 
at the o2,AR, we cannot discount contributions 
from other receptors. 

Five other of the new, docking-derived o2,AR 
agonists (‘2998, ‘4.622, ‘0172, “7075, and PS75) 
also exhibited antiallodynic effects in the SNI 
mice (Fig. 4, A and B). The ‘9087 analogs, “7075 
and PS75, completely reversed the mechanical 
hypersensitivity in the neuropathic pain model, 
with PS75 being more effective than ‘9087 (Fig. 
4A). In contrast to ‘9087, PS75 did increase 
the mechanical thresholds of naive (uninjured) 
mice (Fig. 4A). The antiallodynic effects of ‘4622 
and ‘7075 were reversed by atipamezole (48); 
although this antagonist also partially reversed 
the antiallodynia of PS75, ‘2998, and ‘0172, these 
effects did not reach statistical significance at 
the small numbers of mice tested (Fig. 4C). 
PS75 also increased withdrawal latencies in 
the tail flick (50°C) acute thermal pain assay, 
and when tested in the D79N mutant mice, 
its analgesic effect was reduced by >50% (Fig. 
4, D and E). Compounds ‘0172 and ‘4622 also 
exhibited antihyperalgesic effects in the CFA 
inflammatory pain model (Fig. 4G); ‘2998 did 
not, which may reflect the reduced brain 
penetration of this molecule (table $10). Only 
‘4622 caused slight motor impairment at its 
equianalgesic dose in the rotarod test; how- 
ever, the effect did not reach the full sedation 
observed with dexmedetomidine (Fig. 4H). As 
with ‘9087, increased dosing up to 20 mg/kg 
PS75 did not have an effect on the rotarod test 
(Fig. 4H). Taken together, these pharmaco- 
logical and chemical-genetic epistasis experiments 
support a mechanism of action primarily through 
the o2,AR receptor, though a lesser contribution 
of other a2AR subtypes cannot be ruled out. 

Some o.,AR agonists can produce changes in 
feeding, weight gain, and hyperglycemia (53, 54) 
as side effects. Accordingly, we evaluated the 
effect of compound treatment on body weight 
>48 hours after injection while allowing the 
mice to freely feed. We found no effect on body 
weight for ‘9087 dosed at 5 mg/kg, 10 mg/kg, 
or 20 mg/kg, nor for dexmedetomidine dosed 
at 30 ug/kg (fig. S18). We also investigated 
whether ‘9087 induced constipation, a side 
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versus ‘9087; two-tailed t test; ***P < 0.001). (G) Efficacy of newly characterized agonists in 
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complete sedation at higher doses. All compounds compared with their vehicles (20% 
Kolliphor versus “9087, ‘0172, and ‘4622; 20% cyclodextran versus ‘2298, ‘7075, and PS75; 
saline versus DEX; one-way ANOVA; *P < 0.05; **P < 0.01; ****P < 0.0001). For (A) to (G), 
all compounds were administered s.c., unless otherwise indicated. Data are shown as 
individual data points and means + SEMs (n = 4 to 25). 
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effect well-known for opioids and other classes 
of analgesics, comparing it with dexmedeto- 
midine and with morphine tested at analgesic 
doses (30 pg/kg and 10 mg/kg, respectively). 
The number of accumulated pellets over 6 hours 
after vehicle or compound injection was mea- 
sured. As expected, morphine induced consti- 
pation when compared with vehicle at the 1-, 
2-, and 3-hour marks. By contrast, although 
pellet number did decrease modestly with 
‘9087 and dexmedetomidine, neither effect 
differed significantly from vehicle (fig. S18). 
We recognize that other possible side effects 
remain untested in this study; we return to 
these in the Discussion section. 


Discussion 


Three key observations emerge from this study. 
First, multiple chemotypes, unrelated to known 
agonists, discovered directly from large-library 
docking are efficacious in neuropathic, inflam- 
matory, and acute pain models through o2,AR 
agonism (Fig. 4, A to F). In docking, as in other 
target-based screens, the initial goal is to identify 
molecules with in vitro activity; these are then 
optimized for in vivo activity through extensive 
structure-activity optimization (13, 14, 17, 20). 
Although it may be rare that direct hits from a 
docking screen are themselves in vivo active, 
such activity of the direct docking hits in this 
study does speak to the strengths of inter- 
rogating vast virtual libraries (77, 12). Second, 
functional assays reveal preferential Gio), ac- 
tivation versus other G protein subtypes, no 
B-arrestin activity, and no receptor internal- 
ization for ‘9087 and its analogs, “7075 and PS75, 
compared with the established therapeutic 
02, AR agonists, like dexmedetomidine and 
brimonidine, that also activate multiple other 
G protein and f-arrestin signaling pathways 
(Fig. 1C, Fig. 3B, table S2, and figs. S6 to S8). 
Although the high rate of agonist discovery 
was an intended outcome of docking against 
the activated state of the a2pAR, the functional 
selectivity was not designed and can be attri- 
buted to the chemotypes they explore and, by 
extension, their use of both canonical and 
noncanonical receptor interactions (Fig. 2 
and figs. $12 and S13). There is no logical re- 
quirement that new chemotypes lead to 
new signaling pharmacology, though this 
has often been observed for other receptors 
(0, 13, 17, 25). Third, unlike dexmedetomi- 
dine and clonidine (6, 7, 55), ‘9087 and its 
analogs, PS75 and ‘7075, do not cause seda- 
tion or motor impairment at analgesic doses, 
potentially enabling broader applications to 
pain treatment and attesting to the ability to 
differentiate these two effects with o2,AR 
agonists (Fig. 4H). 

The new o»2aAR chemotypes explored (Fig. 
1B) reflect the size and diversity of the docked 
libraries. Most of the actives emerged from the 
fragment-like library in ZINC, which covers a 


Fink et al., Science 377, eabn7065 (2022) 


much greater portion of the chemical universe 
in its size range than do the lead-like or drug- 
like libraries. This is akin to physical fragment 
libraries, which typically might include ~1500 
molecules (56) but are thought to cover more 
chemotypes than high-throughput screening 
libraries that are 1000-fold larger. Meanwhile, 
the virtual fragment library in ZINC enumer- 
ates more than 20 million molecules (J/, 72), 
~10,000-fold more than in most physical frag- 
ment libraries. With more than 800,000 Bemis- 
Murcko scaffolds (78), there are 500-fold more 
fragment scaffolds in the docked library than 
there are molecules in most physical fragment 
libraries. From this great chemotype diversity 
springs opportunities for ligands with distinct 
pharmacology. 

How the new chemotypes confer new phar- 
macology is uncertain. From the cryo-EM co- 
complex with ‘9087 and substitution of binding 
site residues, ‘9087 and “7075 appear to make 
weaker interactions with D128*? and an 
apparently stronger interaction with F427"”° 
compared with the canonical agonists (Fig. 2B 
and figs. $12 and S13). These structural differ- 
ences may contribute to the unusual func- 
tional Gi/o/, selectivity over other G protein 
subtypes and B-arrestins as compared with 
known agonists, and to the lack of receptor 
internalization (Fig. 3B and figs. S6 to S8). 
In turn, this unusual signaling may play a 
role in conferring analgesia without sedation 
(Fig. 4). Admittedly, as the engagement of 
transducing G proteins and f-arrestins occurs 
35 A away from the orthosteric site, other 
mechanisms may be involved (57). Moreover, 
the physiological impact of the selective sig- 
naling will be entangled with the pharmaco- 
kinetics of the molecules. Regardless, what 
should be clear is that the analgesic potential 
of d2,AR agonists may be disentangled from 
their sedative effect, which is important for 
future drug development. 

Several cautions merit re-emphasis. ‘9087 
and its analogs are not as potent as dexmede- 
tomidine. The action of ‘9087 in vivo is blocked 
by the a,AR antagonist atipamezole (48) and 
much reduced in D79N o2,AR mice, which 
indicates that the o2,AR is the primary re- 
ceptor mediating activity in vivo (Fig. 4, C and 
D). However, especially for the mutant mice, 
we note that although most efficacy above 
baseline was reduced, it was not fully reversed 
(Fig. 4, C and D), and other targets may also 
play arole, including the I2R and other a,AR 
subtypes. We also have not extensively eval- 
uated side effects common for a2,AR agonists, 
especially cardiovascular effects mediated by 
Oo,AR and o»gAR activation (6). Finally, we 
do not anticipate that the ability to translate 
directly from docking hits to in vivo activity, 
as we saw in this study, will be common. In 
this case, it was helped by the small size of 
the o2gAR and o2aAR orthosteric sites and 


30 September 2022 


correspondingly the high potencies and good 
physical properties of the docking hits (fig. S1 
and tables S2 and S10). 

These caveats should not obscure the key 
observations of this study. From an ultralarge 
library docking screen emerged low-nanomolar 
QoaAR partial agonists, topologically unrelated 
to previously known ligands, making previ- 
ously unobserved interactions with the recep- 
tor that appear to confer new pharmacology 
(Fig. 1; Fig. 2B; table S2; and figs. S6 to S8, S12, 
and S13). Several of the new o2,AR agonists 
were antiallodynic and analgesic in neuro- 
pathic and inflammatory pain models and 
against acute nociception in naive animals 
(Fig. 4, A to G). Among the most promising 
are ’9087 and PS75, both of which are strongly 
analgesic without the sedative effects of dex- 
medetomidine (Fig. 4) and are orally bio- 
available (table S10). These properties make 
the compounds plausible therapeutic leads for 
new nonopioid pain therapeutics without the 
sedation of classic o2,AR drugs. 


Materials and methods 
Molecular docking 


The oggAR receptor with dexmedetomidine 
and G,, [Protein Data Bank (PDB) ID: 6K41] 
(9) was used for docking calculations before 
the determination of the a,3AR dexmedeto- 
midine-bound structure (PDB 7EJA; compar- 
ison shown in fig. $1). Three screens of the 
ZINC15 database were run, two for fragment 
molecules (<250 amu, cLogP < 3.5) and one 
for lead-like (250 to 350 amu, cLogP < 3.5). 
Docking was performed with DOCK3.7 (26). 
For the first screen, 45 matching spheres 
(26) were used, 15 from the docked pose of 
dexmedetomidine and 30 from SPHGEN- 
generated spheres (58). The receptor struc- 
ture was protonated using REDUCE (59), and 
AMBER united atom charges were assigned 
(60). In control calculations (67) with 15 known 
agonists from the IUPHAR-BPS database 
(27) and from the literature (29-31), balanced 
against 1800 property matched decoys (62), 
docking parameters were optimized based on 
adjusted logAUC (67) and based on recapit- 
ulation of ligand interactions with residues 
OopAR D922”, F4127°9, F387%°, Y391°°°, and 
F388°°? (residues conserved in o2,AR: D128?”, 
F427"*9, F405°°!, Y409°°°, and F406°°). An 
“extrema” set was used to evaluate cationic 
charge preference, as described (J8, 62). The 
protein low dielectric and desolvation re- 
gions, defined by pseudoatoms calculated 
with SPHGEN (58), were extended as pre- 
viously described (63), based on the control 
calculations, by a radius of 1 and 0.3 A, re- 
spectively (10, 64). Energy potential grids were 
calculated using CHEMGRID for AMBER- 
based van der Waals potential, QNIFFT (65) for 
Poisson-Boltzmann-based electrostatic poten- 
tials, and SOLVMAP (66) for context-dependent 
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ligand desolvation. In the second and third 
docking screens, differences included modi- 
fied matching spheres (added rigid fragments 
of xylazine docked-pose, only used 40 match- 
ing spheres) and extension of the desolvation 
pseudoatoms by a radius of 0.2 A. 

For the first screen, 20 million molecules 
from the ZINC15 (https://zinc15.docking.org/) 
fragments subset were docked in 3008 core 
hours or ~6 wall-clock hours on a 500-core 
cluster. Almost 5 trillion complexes were sam- 
pled, on average each molecule sampled 2405 
orientations and 202 conformations. Only ~8 
of 20 million could be sterically accommodated 
in the orthosteric site, reflecting its small size. 
For the second screen, the same 20 million 
fragments were docked in 3830 core hours or 
7.7 hours on 500-core cluster, sampling more 
than 6 trillion complexes; on average, each 
molecule sampled 3122 orientations and 203 
conformations. About 9 million molecules were 
accommodated in the site. For the third screen, 
281 million molecules from ZINC15 lead-like 
subset were screened in 71,625 core hours or 
~1 week on 500 cores. More than 222 trillion 
complexes were sampled with an average of 
4553 orientations and 469 conformations per 
molecule, though ultimately only 13.5 million 
could sterically fit in the site. 

For the first and second screens, the top 
161,055 scored compounds were clustered by 
ECFP4-based Tanimoto coefficient (Tc) of 
0.5 to identify distinct chemotypes, resulting 
in 37,150 and 33,378 clusters. For the third 
screen, the top 300,000 scored compounds 
were clustered in the same manner, resulting 
in 57,168 clusters. Molecules were filtered for 
novelty, removing those with Tc > 0.35 to 15 
d2aAR agonists used in control calculations. 
The top 5000 ranked molecules remaining 
were visually filtered for interactions at a.,AR 
residues D92°°?, F4127°9, F387%°!, y391%°, 
and F388° "for the first and second screens; 
for the third screen, the top 20,000 molecules 
were examined by the same criteria. Lastly, 
prioritized molecules were also filtered for 
internal torsional strain; this was done visu- 
ally for the first screen, whereas the second 
and third screens used a method drawing on 
CSD torsion populations cutting off at a total 
energy of 2 torsion energy units (32). An ad- 
ditional novelty filter was performed remov- 
ing molecules with TC > 0.35 to CHEMBL29 
(28) o2,AR compounds. Sixty-four molecules 
were selected for purchasing: 33, 26, and 5 from 
the first, second, and third screens, respectively. 
Ten were sourced from WuXi and another 
54 from Enamine, of which 8 and 40 were 
successfully synthesized, respectively. Most of 
these compounds have not previously been 
synthesized before, to the best of our knowledge, 
except for some of the smaller fragments, which 
have been previously used as building blocks 
(table S5). 
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Synthesis of tangible molecules 

Forty-eight molecules prioritized for purchas- 
ing were synthesized by Enamine and Wuxi 
for a total fulfilment rate of 75%. Compounds 
were sourced from the Enamine REAL data- 
base (https://enamine.net/compound-collections/ 
real-compounds) or the WuXi GalaXi virtual 
library. The purities of active molecules synthe- 
sized by Enamine and WuXi were at least 90% 
and typically above 95%. For bespoke compound 
synthesized in house purities were at least 96% 
and typically above 99%. The purity of com- 
pounds tested in vivo were >95% and typically 
above 98% (table S5 and figs. S19 and S20). 


Ligand optimization 

Analogs for four docking hits (‘9087, ‘2998, 
‘0172, and ‘4622) were queried in Arthor and 
SmallWorld 1.4 and 12 Billion tangible libraries 
(https://sw.docking.org/, https://arthor.docking. 
org), the latter primarily containing Enamine 
REAL Space compounds (https://enamine.net/ 


compound-collections/real-compounds/real- 
space-navigator). Results from SmallWorld, 


Bemis-Murcko framework, and substructure 
queries were pooled, docked into the o..z,AR site 
before ‘9087-o,AR structure being determined. 
Compounds with favorable interactions in the 
orthosteric site were prioritized, leading to 13 
analogs for ‘9087 (table S5). Also, for the four 
docking hits, analogs were designed by mod- 
ifying the two-dimensional (2D) chemical 
structure to test specific hypotheses, adding 
another six analogs for ‘9087 (table S5). The 
second round of analogs for ‘9087 were de- 
signed and prioritized for bespoke synthesis. 
Some were docked to a preliminary cryo-EM 
model of the ‘9087-0..,AR structure, whereas 
several were designed and synthesized regard- 
less of docked pose to test specific hypotheses; 
in total, eight of these were synthesized and 
tested (table $5). Calculation of the contact areas 
was performed by means of UCSF Chimera 
(https://rbvi-ucsf.edu/chimera). 


Bespoke synthesis 


“7075 was designed for hypothesis testing and 
was bespoke synthesized by Enamine (fig. S19). 


Molecular modeling of ‘7075 and PS75 


Maestro (version 2019-4, Schrédinger, Inc.) 
was used to manually change the chemical 
structure of ‘9087 to “7075 or PS75 in a pre- 
liminary model of the ‘9087-02,AR complex 
cryo-EM structure. The isoquinoline nitrogen 
was changed to a carbon and the fluorine or 
chlorine substituent was added to the naph- 
thalene ring for “7075 and PS75, respectively. 
The resulting complex of “7075 or PS75 and 
Q2,AR coupled to the G protein but without 
scFv16 was energy minimized following the 
Protein Preparation Wizard protocol using the 
OPLS3e force field. The maximum heavy atom 
deviation from the initial model was 0.3 A. 
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Passive membrane permeability prediction 
Ligand structures were converted from SMILES 
strings to 3D structures using LigPrep (version 
53013, Schrédinger, Inc.). For the passive mem- 
brane permeability prediction (67, 68), we re- 
tained only neutral form for each ligand. Passive 
membrane permeability of a ligand is predicted 
from the free-energy of insertion (AG), i.e., from 
the energy difference between a conformer in 
low and high dielectric media. Therefore, we 
generated conformations of each ligand using 
ConfGen software (version 5.1, Schrédinger, Inc.). 
We minimized each conformer in a low dielectric 
medium (chloroform) to mimic the membrane 
dielectric using Protein Local Optimization 
Program (PLOP) (69). After finding the lowest 
energy conformer in the low dielectric me- 
dium, we calculated the energy of that energy- 
minimized conformer in water. We subtracted 
the energy of the ligand in the high-dielectric 
water from the low-dielectric medium. We 
further added a deionization penalty term to 
account for transforming ionized form of the 
ligand in water to its neutral form in mem- 
brane. We computed the deionization penalty 
energy using the empirical pK, (where K, is 
the acid dissociation constant) prediction soft- 
ware Epik (version 5.1013, Schrédinger, Inc.). 
We rank-ordered the ligands based on their 
free-energy of insertion. 


Radioligand binding experiments 


Receptor binding affinities for the o2,AR re- 
ceptor and to odopAR as well as the related 
adrenergic subtypes oy, Org, Oc, bi, and By 
were determined as described previously 
(44, 70). In brief, membranes were prepared 
from HEK293T cells transiently transfected 
with the cDNA for human a2,AR, murine 
d2aAR (provided by D. Calebiro, Birmingham, 
UK), human ogpzAR (obtained from the cDNA 
resource center, www.cdna.org), or with the 
cDNAs for the human oj,, Op, O2c, Be (all from 
cDNA resource center), and B; (provided by 
R. Sunahara, UCSD). Receptor densities (Bax 
value) and specific binding affinities (Kg value) 
for the radioligand [*H]RX82,1002 (specific 
activity 52 Ci/mmol, Novandi, Sddertalje, 
Sweden) were determined as 1400 + 210 fmol/ 
mg protein and 0.54 + 0.024 nM for human 
Oo,AR, 4000 + 720 fmol/mg protein and 
1.8 + 0.61 nM for murine ay,AR, and 3400 + 
580 fmol/mg protein and 2.3 + 0.52 nM for 
QopAR, respectively. Further values are 3200 + 
1900 fmol/mg protein and 0.58 + 0.11 nM 
for doc, 2000 + 950 fmol/mg protein and 
0.70 + 0.13 nM for oy,, and 4000 fmol/mg 
protein and 0.11 nM for o,pg, both determined 
with [°H]prazosin (51 Ci/mmol, PerkinElmer, 
Rodgau, Germany), respectively, and 1400 + 
360 fmol/mg protein and 0.070 + 0.006 nM 
for 8B, and 1300 + 230 fmol/mg protein and 
0.074 + 0.012 nM for B., both determined with 
[?H]CGP12,188 (52 Ci/mmol, PerkinElmer). 
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Competition binding with a,AR subtypes 
were performed by incubating membranes 
in buffer A (50 mM TRIS at pH 7.4) at final 
protein concentrations of 3 to 10 ug per well 
with the radioligand (final concentration 0.5 
to 2.0 nM according to the appropriate Ka and 
Bmax) and varying concentrations of the com- 
peting ligands for 60 min at 37°C. Binding to 
4, and op was measured with buffer B (60 mM 
TRIS, 5mM MgClo, 1 mM EDTA, 100 ng/mL 
bacitracin and 5 ug/mL soybean trypsin inhib- 
itor at pH 7.4) at 2 to 6 ug per well (radioligand 
at 0.2 to 0.3 nM) and binding to f; and B. was 
measured with buffer C (25 mM HEPES, 5 mM 
MgCl, 1 mM EDTA, and 0.006% bovine serum 
albumin at pH 7.4) at 4 to 8 ug per well (radio- 
ligand 0.2 nM). Nonspecific binding was deter- 
mined in the presence of unlabeled ligand at 
10 uM. Protein concentration was measured 
using the method of Lowry (77). 

The resulting competition curves were ana- 
lyzed by nonlinear regression using the algo- 
rithms implemented in Prism 8.0 (GraphPad 
Software, San Diego, CA) to provide mean 
inhibitory concentration (IC;9) values, which 
were subsequently transformed into a K; val- 
ues applying the equation of Cheng and Prusoff 
(72). Mean K; values (+SEM for n = 3 or +SD 
for n = 2) were derived from two to seven ex- 
periments each performed in triplicates. 


Functional assays 
Plasmids 


The human wild-type (WT) o2,AR, its respec- 
tive receptor mutants (73) and the murine 
doa AR, all carrying an N-terminal HA-signal 
sequence and a FLAG-tag, as well as the human 
adrenergic receptor subtypes 04,, Op; Osc, By, 
and B. and the dopamine receptor Dzjong were 
cloned to pCDNA3.1 for G protein activation 
assays [BRET, inositol phosphate (IP) accumu- 
lation]. Human o.,AR and a gAR were fused 
to the ARMS2-PK2 sequence and cloned to 
pCMV (DiscoverX, Eurofins) for B-arrestin-2 
recruitment assays, respectively, using poly- 
merase chain reaction (PCR) and Gibson As- 
sembly (New England Biolabs) (70). Sequence 
integrity was verified by DNA sequencing 
(Eurofins Genomics). 


Bioluminescence resonance 
energy transfer 


G protein activation by human a2,AR and 
Dojong Was monitored with Goj-RLuclI (74, 75) 
together with GB, and Gy2-GFP 5. Assessment 
of arrestin recruitment was performed by en- 
hanced bystander BRET using CAAX-rGFP and 
B-arrestin-2-RLucll as biosensors (39, 74) in 
the presence of GRK2, as described (44, 76). 
In brief, HEK293T cells (gift from the Chair 
of Physiology, FAU Erlangen-Niirnberg) were 
transfected with 200 ng receptor plasmid for 
G protein activation (receptor:Ga:GB:Gy ra- 
tio 2:0.5:1:4) or 100 ng receptor plasmid for 
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B-arrestin recruitment (receptor:f-arrestin: 
GRK2:CAAX ratio 1:0.2:1:3) using linear poly- 
ethyleneimine (PEI, Polysciences, 3:1 PEI:DNA 
ratio). The DNA was complemented to a total 
amount of 1 ug DNA per 3 x 10° cells with 
single-stranded DNA (ssDNA) (Sigma Aldrich) 
and 10,000 cells per well were transferred into 
96-well half-area plates (Greiner, Frickenhausen, 
Germany). Additional experiments were per- 
formed using the same amount of G protein 
plasmids as described above but 50 ng or 10 ng 
doaAR plasmid instead. 48 hours after trans- 
fection, the cell medium was exchanged with 
PBS (phosphate buffered saline), and cells were 
stimulated with ligands at 37°C for 10 min. 
Coelenterazine 400a (abcr GmbH, Karlsruhe, 
Germany) at a final concentration of 2.5 uM 
was added 5 min before measurement. BRET 
was monitored on a Clariostar plate reader 
(BMG, Ortenberg, Germany) with the appro- 
priate filter sets (donor 410/80 nm, acceptor 
515/30 nm) and was calculated as the ratio of 
acceptor emission to donor emission. BRET 
ratio was normalized to the effect of buffer (0%) 
and the maximum effect of norepinephrine 
(100%) for adrenergic receptors and quinpirole 
(100%) for Dojiong. For each compound, 3 to 
17 individual experiments were performed, 
each done in duplicates. 

Surface expression of the o2,AR in the G 
protein activation assays was monitored apply- 
ing an enzyme-linked immunosorbent assay 
(ELISA) directed against the N-terminal FLAG 
tag. HEK293T cells were transfected with the 
cDNAs encoding o2,4AR, Gaj-RLucll, GB, Gyo- 
GFPj,, and ssDNA, as described above. As a 
control, cells transfected with only o2,AR or 
mock pcDNA3.1 plasmid and ssDNA were used. 
Immediately after transfection, 50,000 cells 
per well were transferred to a 48-well plate 
(Greiner) pretreated with poly-D-lysine (Sigma 
Aldrich) and incubated at 37°C and 5% CO, for 
48h. The medium was removed, cells were 
treated with 4% paraformaldehyde for 10 min, 
washed once (wash buffer, 150 mM NaCl, 25 mM 
Tris, pH 7.5), and blocked for 60 min (30 g/liter 
skim milk powder in wash buffer, all steps 
carried out at room temperature). After incu- 
bation with anti-FLAG M2 mouse IgG (F3165, 
Sigma Aldrich, 1:4000 in blocking solution) 
for 60 min, cells were washed twice, blocked 
again for 60 min and incubated with anti- 
mouse rabbit IgG-HPR (A9044, Sigma Aldrich, 
1:20,000 in blocking solution) for 60 min. Cells 
were washed thrice, before 200 nL substrate 
buffer was added (2.8 mM o-phenylenediamine, 
35 mM citric acid, 66 mM Na,HPOu,, pH 5.0). 
Reactions were kept in the dark for 5 to 15 min 
and stopped by addition of 1 M H,SO, (200 uL). 
Resulting mixtures were transferred to a 96- 
well plate and absorption was determined with 
the Clariostar microplate reader at 492 nm. 
Data were normalized using cells transfected 
with only o2,4AR (100%) and mock pcDNA3.1 
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(0%), respectively. 2 = 4 independent experi- 
ments were performed with each condition in 
triplicate. 

The sensitivity of selected ligands to the 
receptor mutants 02,AR-D1287°7A, o2,AR- 
D128"? T, oo, AR-D128*?L, oo,AR-S215°"7A, 
Oty, AR-Y409°°A, Go, AR-Y409°°T, oo, AR- 
Y409° °F, oy ,AR-F427"°F, oo, AR-Y431"A, 
and o5,AR-Y431""F was monitored by G pro- 
tein activation as described above, transfecting 
the appropriate receptor together with Goj,- 
RLucll and GB,/Gyo-GFPj9. Data were analyzed 
as ligand-induced changes in BRET compared 
with vehicle (deltaBRET), and normalization 
was done according to the effect of buffer (0%) 
and norepinephrine (100%) with the exception 
of oo,AR-D128?”"A (dexmedetomidine = 100%), 
Oo, AR-D128?"T, and o,AR-D128*"L (‘7075 = 
100%). Similarly, the effect of the a.,AR- 
D128*°7A, oo ,AR-S215°*7A, o.,AR-Y409° "A, 
Oto, AR-Y4.09°°F, 01, AR-F4277 °F, 12, AR- 
Y431"7A, and o2,AR-Y431"“"F mutations on 
arrestin recruitment was evaluated as de- 
scribed above, transfecting the appropriate 
receptor together with CAAX-rGFP, GRK2, and 
B-arrestin-2-RLucll. Data were analyzed as 
ligand-induced changes in BRET compared 
with vehicle (deltaBRET). Three to seven ex- 
periments were done in duplicate. 


IP accumulation assay 


Determination of G protein-mediated signaling 
by human o2,AR, murine o2,AR, and human 
QopAR was performed applying an IP accu- 
mulation assay (IP-One HTRF, Cisbio, Codolet, 
France) according to the manufacturer’s pro- 
tocol and in analogy to previously described 
protocols (77, 78). In brief, HEK 293T cells were 
cotransfected with the cDNA for a receptor 
and the hybrid G protein Gag; (Gag protein 
with the last five amino acids at the C terminus 
replaced by the corresponding sequence of Go;) 
(gift from The J. David Gladstone Institutes, 
San Francisco, CA), respectively, in a ratio 
of 1:2. After 1 day, cells were transferred into 
384-well microplates (Greiner) and incubated 
for further 24 hours. On the day of the ex- 
periment, cells were incubated with test com- 
pounds for 90 min (o12,AR) or 120 min (dypAR), 
and accumulation of second messenger was 
stopped by adding detection reagents (IP1-d2 
conjugate and Anti-IPicryptate TB conjugate). 
After 60 min, TR-FRET was monitored with 
a Clariostar plate reader. FRET-signals were 
normalized to buffer (0%) and the maximum 
effect of norepinephrine (100%). Three to nine 
(murine o2,AR, oapAR) or 4 to 11 repeats (hu- 
man o2,AR), respectively, were performed for 
each test compound all done in duplicate. 


PathHunter arrestin recruitment assay 


Investigation of o2,AR and o».gAR stimulated 
B-arrestin-2 recruitment was performed ap- 
plying an assay which is based on fragment 
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complementation of B-galactosidase (PathHunter 
assay, DiscoverX, Birmingham, UK) as described 
(79). In detail, HEK293T cells stably expressing 
the enzyme acceptor (EA) tagged f-arrestin-2 
were cotransfected with human o2,AR or oagAR 
each fused to the ProLink-ARMS2-PKS2 frag- 
ment for enzyme complementation and GRK2 
(cDNA Resource Center) at equal amounts 
and subsequently transferred into 384 well 
microplates (Greiner) after 1 day. After in- 
cubation for further 24 hours cells were in- 
cubated with test compounds for 60 min 
(24AR) or 90 min (o2RAR), arrestin recruit- 
ment was stopped by adding detection re- 
gent and the resulting chemoluminescence 
was monitored with a Clariostar plate micro- 
reader. Data were normalized relative to 
buffer (0%) and the maximum effect of nor- 
epinephrine (100%). Three to nine repeats 
for 2,AR (three to six for a23AR) in duplicate 
were measured. 


DiscoverX HitHunter cAMP G protein 
activation assay 


Dexmedetomidine, brimonidine, ‘9087, and 
“7075 were tested by DiscoverX (catalog item 
86-0007P-2270AG; Eurofins; CA, USA) in their 
HitHunter XS+ assay. Freezer stock cAMP 
Hunter cell lines were expanded, then seeded 
in a total volume of 20 uL into white walled, 
384-well microplates and incubated at 37°C 
before testing. For agonist determination, cells 
were incubated with compound samples in the 
presence of ECgg forskolin to induce response. 
Media was aspirated from cells and replaced 
with 15 pL 2:1 HBSS/10 mM Hepes: cAMP XS+ 
Ab reagent. Intermediate dilution of sample 
stocks was performed to generate 4X sample 
in assay buffer containing 4X ECg, forskolin. 
5 wL of 4X sample was added to cells and in- 
cubated at 37°C or room temperature for 30 
to 60 min. Finally assay vehicle concentration 
was 1%. After sample incubation assay signal 
was generated through incubation with 20 uL 
cAMP XS+ ED/CL lysis cocktail for one hour 
followed by incubation with 20 nL cAMP XS+ 
EA reagent for three hours at room temperature. 
Microplates were read after signal generation 
with a PerkinElmer Envision Instrument for 
chemiluminescent signal detection. Compound 
activity was analyzed using CBIS data analysis 
suite. For G; agonist mode, percentage activ- 
ity is calculated using the following formula: 
percentage activity = 100% x [1 - (mean RLU 
of test sample - mean RLU of MAX control) / 
(mean RLU of vehicle control — mean RLU of 
MAX control)]. Brimonidine was used as the 
control agonist. Each measurement was done 
in duplicate. 


EMTA coupling panel for o2,AR 


The ebBRET-based effector membrane trans- 
location assay (EMTA) allows detection of each 
Ga protein subunit activation. Upon receptor 
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activation, G protein-effector proteins fused at 
their C terminus to Renilla luciferase (RlucID) 
translocate from cytoplasm to the plasma mem- 
brane to selectively bind activated Ga proteins 
(p63-RhoGEF-RluclI with G1; family, Rap1- 
GAP-RlucIl with Gj, family and PDZ-RhoGEF- 
RluclII with Gy/;3 family), thus leading to an 
increase in ebBRET by becoming in close prox- 
imity to the plasma membrane targeted energy 
acceptor, Renilla green fluorescent protein 
(rGFP-CAAX). The heterologous coexpres- 
sion of each Go subunits allow to identify 
which specific members of each G protein 
families (i.e.: Giz, Gio, Giz, Goa, Gos, Gz, Ga, 
Gy, Gi4, Gis, Giz, and G3) is activated by a 
receptor. The assay is also sensitive enough 
to detect responses elicited by endogenous 
Gi/o protein families in the absence of het- 
erologously expressed G protein. The same 
plasma membrane translocation principle 
is used to measure f-arrestin-1 or -2 recruit- 
ment (39) using B-arrestin-RlucII/rGFP-CAAX 
biosensors. 


Cell culture 


HEK293 clonal cell line (HEK293SL cells), here- 
after referred to as HEK293 cells, were a gift 
from S. Laporte (McGill University, Montreal, 
Quebec, Canada) and previously described 
(39). Cells were cultured in DMEM medium 
(Wisent; St-Jean-Baptiste, QC, Canada) supple- 
mented with 10% newborn calf serum iron 
fortified (NCS) (Wisent). Cells were passaged 
weekly and incubated at 37°C in a humidified 
atmosphere with 5% CO, and checked for my- 
coplasma contamination. 


Transfection 


HEK293 cells (1.2 mL at 3.5 x 10° cells per mL) 
were transfected with a fixed final amount of 
premixed biosensor-encoding DNA (0.57 ug, 
adjusted with salmon sperm DNA; Invitrogen) 
and human o2,AR DNA for Gy, Gijo, Gq and 
B-arrestins experiments. For Gy2/;3 experiments, 
cells were transfected with 1 ug of total DNA 
(adjusted with salmon sperm DNA; Invitrogen), 
including empty pCDNA3.1 vector or human 
OoaAR DNA. Transfections were performed 
using linear polyethylenimine (PEI, 1 mg/mL; 
Polysciences) diluted in NaCl (150 mM, pH 7.0) 
as a transfecting agent (3:1 PEI/DNA ratio). 
Cells were immediately seeded (3.5 x 10* 
cells per well) into 96-well white microplates 
(PerkinElmer), maintained in culture for the 
next 48 hours and BRET experiments carried 
out. ebBRET (38) was used to monitor the acti- 
vation of each Ga protein, as well as B-arrestin- 
1 and -2 recruitment to the plasma membrane. 
Go, protein engagement was measured be- 
tween the plasma membrane marker rGFP- 
CAAX and human Go,-RluclIl in presence of 
human Gp, Gyg, and o2,AR. Goys or Goy3 pro- 
tein family activation was assessed using the 
selective-Gy2/13 effector PDZ-RhoGEF-Rlucl1 
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and rGFP-CAAX coexpressed with Gf, Gy, 
and either Go, or Goys, in presence of do,AR. 
Goj/. protein family activation was followed 
using the selective-G;,, effector Rap1GAP- 
RlucII and rGFP-CAAX along with the human 
Goji, Goj2, GOoa, GOop, Or Ga, subunits and 
OoaAR. Gdg protein family activation was 
determined using the selective-Gy/, effector 
p63-RhoGEF-RluclII and rGFP-CAAX along 
with the human Gog, Go, Goy4, Or Goys sub- 
units and o.,AR. B-arrestin recruitment to 
the plasma membrane was determined using 
DNA mix containing rGFP-CAAX and B-arrestin- 
1-Rlucll or B-arrestin-2-Rlucll in presence 
of OoaAR. 


Bioluminescence resonance energy 
transfer measurement 


The day of the experiment, cells were washed 
with phosphate-buffered saline (PBS) and in- 
cubated in Tyrode Hepes buffer 137 mM NaCl, 
0.9 mM KCl, 1 mM MgCl, 11.9 mM NaHCOs, 
3.6 mM NaH»PO,, 25 mM HEPES, 5.5 mM 
D-Glucose and 1 mM CaCl, pH 7.4) for 1 hour 
at 37°C. Cells were then treated with increas- 
ing concentrations of compounds for 10 min at 
37°C. The luciferase substrate Prolume purple 
(1 uM, NanoLight Technologies) was added 
during the last 6 min before the reading. Plates 
were read on the TriStar” LB 942 Multimode 
Microplate Reader (Berthold Technologies) 
with the energy donor filter (410 + 80 nm; 
RluclD) and energy acceptor filter (515 + 40 nm; 
GFP10 and rGFP CAAX). BRET signal (BRET”) 
was determined by calculating the ratio of the 
light intensity emitted by the acceptor (515 nm) 
over the light intensity emitted by the donor 
(410 nm) and data were normalized in per- 
centage of the maximal response elicited by 
the reference compound norepinephrine. The 
data were analyzed in GraphPad Prism 9.1 
using “dose-response-stimulation log(agonist) 
versus response (four parameters)” and data 
were presented as means + SEMs of at least 
three different experiments each done in sim- 
plicate. E,,2; and EC;, values were determined 
from dose-response curves to calculate the log 
(Emax/ECs0) value for each pathway and each 
compound. To determine the relative efficacy 
of the compounds to activate the different 
signaling pathways, the difference between 
the log(Emax/ECs0) values was calculated using 
the following equation 


Emax Emax 
Alog ( ) = log ( ) 
ECs ECs compound 


- & 
ECs norepinephrine 


(1) 


The compounds’ efficacy toward each path- 
way, relative to norepinephrine, were cal- 
culated as the inverse logarithm of the Alog 
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(Emax/ECso) using the following equation 


Emax 
Cs 


, 7 alos (§ ) 
relative efficacy (RE) = 10 oy (2) 


The SEM was calculated for the log(Eyyax/ 
EC; 9) ratios using the following equation 


SEM = (3) 


Vn 


where o is the standard deviation, and n is 
the number of experiments. 

The SEM was calculated for the Alog(Einax/ 
ECs) ratios using the following equation 


SEM = 
() 


vi SEM compound) : ate (SEM norepinephrine ) ; (4) 


Statistical analysis was performed using a two- 
tailed unpaired ¢ test on the Alog(E;yax/ECs9) 
ratios to make pairwise comparisons between 
tested compounds and norepinephrine for a 
given pathway, where P < 0.05 was considered 
statistically significant. 


Internalization assay with rGFP-CAAX and 
rGFP-FYVE biosensors 
Plasmids 


Human o.,AR sequence was fused to Rlucll 
by cloning between the Nhel and BamHI sites 
of pCDNA3.1/Zeo(+)-RluclI vector, using PCR 
(Q5 Hot Start High-Fidelity DNA Polymerase 
from NEB), enzymatic digestion (NEB) and 
ligation (Anza T4 DNA Ligase Master Mix; 
Invitrogen). 


Transfection 


The protocol used for transfection is the same 
as for Gyo/13 EMTA experiments [i.e., cells were 
transfected with 1ug of total DNA (adjusted 
with salmon sperm DNA; Invitrogen)]. Trans- 
fections were performed using linear poly- 
ethylenimine (PEI, 1 mg/mL; Polysciences) 
diluted in NaCl (150 mM, pH 7.0) as a trans- 
fecting agent (3:1 PEI/DNA ratio). Cells were 
immediately seeded (3.5 x 10* cells per well) 
into 96-well white microplates (PerkinElmer), 
maintained in culture for the next 48 hours 
and BRET experiments carried out. Human 
Q2,AR internalization was evaluated by mea- 
suring the disappearance of ha.,AR-Rlucll 
from the plasma membrane labeled with rGFP- 
CAAX and its relocalization in endosome la- 
beled with rGFP-FYVE (39). 


Bioluminescence resonance energy 
transfer measurement 


The day of the experiment, cells were washed 
with phosphate-buffered saline (PBS) and in- 
cubated in Tyrode Hepes buffer for 1 hour at 
37°C. Cells were incubated during 6 min with 
the luciferase substrate Prolume purple (1 uM, 


Fink et al., Science 377, eabn7065 (2022) 


NanoLight Technologies) before addition of 
the indicated compound (0 or 100 uM) and 
kinetics were recorded during 30 min. For 
concentration-response curves, BRET signal 
was measured after 30 min incubation. Plates 
were read on a Spark microplate reader (Tecan; 
Mannedorf, Switzerland) using the BRET2 
manufacturer settings. BRET signal (BRET”) 
was determined by calculating the ratio of 
the light intensity emitted by the acceptor 
(515 nm) over the light intensity emitted by 
the donor (410 nm) and for concentration- 
response curves, data were normalized in per- 
centage of the maximal response elicited by 
the reference compound norepinephrine. The 
data were analyzed in GraphPad Prism 9.1 
using “log(agonist) versus response-Variable 
slope (four parameters)” and data were pre- 
sented as means + SEMs of three experiments 
performed in triplicate for kinetics or in sim- 
plicate for concentration-response curves. 


Cryo-EM sample preparation and 

structure determination 

Preparation of the ‘9087-a2,AR-Go,-scFv16 
and '4622-02,AR-Go,-scFv16 complexes 


The human WT o2,AR was cloned to pFastBac 
vector with a N-terminal FLAG tag and a 
C-terminal histidine Tag. This construct was 
expressed in Sf9 insect cells using the pfastBac 
baculovirus system (Expression Systems). Cells 
were infected at a density of 4 x 10° cells per 
ml and then incubated for 48 hours at 27°C. 
Receptor was extracted and purified following 
the protocol described previously for oggAR 
(9). Briefly, receptor was purified by Ni-NTA 
chromatography, Flag affinity chromatogra- 
phy and size exclusion chromatography in 
the presence of 100 uM ‘9087 or ‘4622. The 
monomeric peak fractions of receptor were 
collected and concentrated to ~20 mg/mL. 
The freshly purified ‘9087-bound or ‘4622- 
bound o.,AR was used for complex forma- 
tion with the G protein. G,, heterotrimers 
were expressed and purified as previously 
described with minor modifications (78). 
Briefly, Hi5 cells were grown to a density of 
3 million per mL and then infected with 
Go, and Gy baculovirus at a ratio of 10 to 
20 mL/L and 1 to 2 mL/L, respectively, and 
then incubated for 48 hours at 27°C. Cells 
were solubilized with 1% (w/v) sodium cholate 
and 0.05% (w/v) DDM. After centrifugation, 
the supernatant was loaded onto Ni-NTA col- 
umn and then exchanged to 0.05% DDM. The 
eluted G,, heterotrimer was dephosphorylated 
by lambda phosphatase (homemade) and fur- 
ther purified through ion exchange using a 
Mono Q 10/100 GL column (GE Healthcare) 
and the peak fractions were collected and 
flash frozen in liquid nitrogen until use. The 
scFv16 (80) protein was expressed in insect 
Sf9 cells and purified with Ni-NTA column fol- 
lowed by the Superdex 200 Increase 10/300 GL 
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column (GE Healthcare) with a buffer com- 
posed of 20 mM HEPEs, pH 7.5 and 100 mM 
NaCl. The monomeric peak fractions of re- 
ceptor were collected and concentrated and 
stored at -80°C until use. The complex forma- 
tion process is same as described. Briefly, the 
complex of o2,AR with heterotrimeric Goa 
was formed in a buffer containing 20 mM 
HEPEs pH 7.5, 100 mM NaCl, 0.1% DDM, 
1mM MgCly, 10 uM GDP, and 100 uM ‘9087 
or ‘4622. The d2,AR-G,o, complex was then 
treated with 50 units of apyrase (NEB) on ice 
overnight, and exchanged on an anti-Flag 
M1 column into a buffer containing 20 mM 
HEPES, pH 7.5, 100 mM NaCl, 0.0075% lauryl 
maltose neopentyl glycol (MNG, NG310 Ana- 
trace), 0.0025% GDN (GDNI101, Anatrace), and 
0.001% CHS, 100 uM ‘9087 or ‘4622, and 2 mM 
CaCl, in a stepwise manner. After elution by 
adding 5 mM EDTA and 0.2 mg/mL Flag pep- 
tide, the complex was concentrated and incu- 
bated with 15x molar excess scFv16 for 1 hour 
on ice, then further purified using Superdex 200 
Increase 10/300 GL column (GE Healthcare) 
with a running buffer of 20 mM HEPES, pH 7.5, 
100 mM NaCl, 0.00075% MNG, 0.00025% 
GDN and _0.0001% CHS, 100 uM ‘9087 or ‘4622. 
The monomeric peak fraction of 2,AR-Go,q 
complex was collected and concentrated to 
~5 mg/mL for cryo-EM. 


Cryo-EM data collection, processing, and 
model building 


3 uL of purified complex sample was applied 
onto the grid (CryoMatrix nickel titanium 
alloy film, R1.2/1.3, Zhenjiang Lehua Elec- 
tronic Technology Co., Ltd.) (87) glow discharged 
at Tergeo-EM plasma cleaner and then blotted 
for 3 s with blotting force of 0 and quickly 
plunged into liquid ethane cooled by liquid 
nitrogen using Vitrobot Mark IV (Thermo 
Fisher Scientific, USA) at 10°C and with 95% 
humidity. Cryo-EM data were collected on a 
300 kV Titan Krios Gi3 microscope. The raw 
movies were recorded by Gatan K3 BioQuan- 
tum Camera at the magnification of 105,000 
and the corresponding pixel size is 0.85 A. 
Inelastically scattered electrons were excluded 
by a GIF Quantum energy filter (Gatan, USA) 
using a slit width of 20 eV. The movie stacks 
were acquired with the defocus range of —1.0 
to -1.6 um with total exposure time 2.5 s frag- 
mented into 50 frames (0.05 s per frame) with 
the dose rate of 22.0 e per pixel per second. The 
imaging mode is super resolution with 2-time 
hardware binning. The semiautomatic data ac- 
quisition was performed using SerialEM (82). 
For the ‘9087-012,AR-G,,-ScFV16 complex, raw 
movie frames were aligned with MotionCor?2 (83) 
using a 9 x 7 patch and the contrast transfer 
function (CTF) parameters were estimated using 
Gctf and ctf in JSPR (84). Micrographs with con- 
sistent CTF values including defocus and astig- 
matism parameter were kept for the following 
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image processing, which kept 3768 micrographs 
from 4217 raw movies. Templates for particle 
auto-picking were generated by projecting the 
3D volume of norepinephrine-bound 0..,AR-Goa 
complex (40). The 2,137,146 particles picked 
from template picking was subjected 2D clas- 
sification in cryoSPARC (85) and 3D classica- 
tion in Relion3.1 (86). The sorted 321,762 particles 
were then subjected to homogeneous recon- 
struction in cryoSPARC, yielding a 3.57-A map. 
Further 3D ab initio reconstruction reduced 
the particles number to 287,431, which was 
subjected to CTF refinement and nonuniform 
refinement after extracting with larger particle 
box size, and finally yield the 3.4’7-A map. 

For the ‘4622-02,AR-G,,-scFv16 complex, 
6983 raw movies were collected and subjected 
for motion correction using MotionCor2 (83). 
Contrast transfer function parameters were 
estimated by CTFFIND4, implemented in 
Relion3.1 (86). 2,593,747 particles were auto- 
picked using the templates in RELION3.1 
and then subjected to 2D classification using 
cryoSPARC. Selected particles with appropri- 
ate 2D average from 2D classification were 
further subjected to ab initio reconstruction. 
Particles with appropriate initial model were 
selected from ab initio followed by heteroge- 
neous refinement in cryoSPARC. The particles 
kept to 563,506 particles were subjected to 
nonuniform refinement and local refinement 
and yield a 3.38-A reconstruction determined 
by gold standard Fourier shell correlation using 
the 0.143 criterion. 

The norepinephrine-o.,AR-Go,4 complex 
structure (PDB 7EJO) (40) was used as the 
initial template for model building. The model 
was docked into the cryo-EM density map using 
UCSF Chimera (https://rbvi-ucsf.edu/chimera), 
followed by iterative manual building in Coot 
(87) and real space refinement in Phenix. The 
statistics of the final models were validated by 
Molprobity. Cryo-EM data collection, refine- 
ment, and validation statistics are summarized 
in table S11. The ligand symmetry accounted 
RMSDs between the docked pose and cryo- 
EM pose of ‘9087 and ‘4622 were calculated 
by the Hungarian algorithm in DOCK6 (88). 


pK, determination for ‘9087 


The pK, of ‘9087 (2.90 mg, 0.013 mmol) was 
determined by potentiometric titration using 
a Metrohm pH Meter 632 equipped with a 
glass electrode (Metrohm 6.0259.100). The com- 
pound was dissolved in 15 mL of 10% meth- 
anol aqueous solution, at an ionic strength of 
I = 0.15 M using KCl. The resulting solution 
was stirred throughout the experiment using 
a magnetic stir bar and a magnetic agitator. 
The compound was titrated with 0.01 M HCl 
(Titrisol) using an automatic burette (Metrohm 
Dosimat Plus 876). The titrant was added to 
the analyte stepwise (0.024 to 2.87 mL). The 
resulting graph for pK, determination is pre- 
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sented in dependence of t and pH(t). The pK, 
value was then determined using a simplified 
Henderson-Hasselbalch equation. The data 
from the titration experiment were evaluated 
with Origin 9.60. 


Off-target activity 
GPCRome 


10 uM ‘9087 was tested for off-target activity 
at a panel of 320 nonolfactory GPCRs using 
PRESTO-Tango GPCRome arrestin-recruitment 
assay as described (45). Receptors with at least 
threefold increased relative luminescence over 
corresponding basal activity are potential posi- 
tive hits. Screening was performed by the 
National Institutes of Mental Health Psycho- 
active Drug Screen Program (PDSP). Detailed 
experimental protocols are available on the 
NIMH PDSP website at https://pdsp.unc.edu/ 
pdspweb/content/PDSP%20Protocols%2011% 
202013-03-28.pdf. 


D2R activation 


D2R was selected after the GPCRome panel, 
and ‘9087 was retested for full dose-response 
to determine G protein and arrestin recruit- 
ment (see above). 


12R binding 


Top docking compounds (‘9087, ‘2998, ‘4622, 
and ‘0172) were tested for I2R binding, per- 
formed by Eurofins Cerep (France; catalog no. 
81) as described (78). For compound ‘2998, no 
binding was seen in a single point radioligand 
competition experiment tested at 500 nM and 
the compound is not shown. 


uOR competition binding 


Equilibrium [3H] Diprenorphine competition 
and saturation binding were carried out in 
membranes prepared from Chinese Hamster 
Ovary (CHO-K1) cells stably expressing hu- 
man u-opioid receptor, as previously described 
(89-91). Briefly, binding was performed at 
25°C for 90 min in the dark. Binding in wOR/ 
CHO-K1 cells was carried out in a buffer con- 
sisting of 5|0 mM HEPES-base pH 7.4 (pH 
adjusted with KOH), 10 mM MgCl2, 0.1 mM 
EDTA, and 0.1% (w/v) bovine serum albumin 
with membranes containing ~40 ug/mL pro- 
tein. After incubation with radioligand (1 pM 
to 10 nM for saturation, 500 pM for compe- 
tition), drugs (33 uM to 3.3 pM) and/or 20 uM 
cold competitor naloxone, the reaction was 
rapidly filtered onto GF/B (PerkinElmer no. 
1450-521) glass fiber filtermats which were 
equilibrated for 1 hour in binding buffer sup- 
plemented with 0.3% (v/v) polyethyleneimine. 
The filtermats were washed five times in 
ice-cold 50 mM HEPES-base pH 7.4 using a 
PerkinElmer semiautomated cell harvester 
(PerkinElmer FilterMate Harvester). The fil- 
termats were dried and Meltilex solid scintillant 
(PerkinElmer no. 1450-442) was melted onto 
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the mats for 10 min at 60°C. The scintillant 
was allowed to resolidify before disintegra- 
tions were quantified with a Wallac MicroBeta 
Scintillation counter using an integration time 
of 1 min. Nonspecific binding, total bind- 
ing, the number of receptor binding sites, 
and the Kg of the radiotracer were determined 
from saturation binding experiments. Protein 
concentrations were determined using the 
microBCA method with BSA as the standard. 
K;, values were calculated by nonlinear re- 
gression analysis and application of the Cheng- 
Prusoff correction in GraphPad Prism 9.0. 


hERG inhibition assays 


‘9087 was tested for hERG inhibition in the 
FluxOR assay as described (92). hERG exper- 
iments used the National Institutes of Mental 
Health (NIMH) Psychoactive Drug Screening 
Program (PDSP). Experimental protocols are 
available on the NIMH PDSP website at https:// 
pdsp.unc.edu/pdspweb/content/PDSP% 


20Protocols%2011%202013-03-28.pdf. 


Metabolic stability studies 


Metabolic stability of the test compounds was 
assessed using male pooled rat liver micro- 
somes (Sprague Dawley, Sigma Aldrich) as 
previously described (93, 94). The reactions 
were carried out in 2 mL polyethylene tubes on 
a rotator carousel (Stuart SB3) in an incubator 
at 37°C. The incubation mixture contained 
‘9087, °7075, PS75, or the positive controls 
rotigotine or imipramine (final concentra- 
tion 20 uM), and pooled rat liver microsomes 
(0.25 mg protein per tube) in Tris-MgCl, buffer 
(50 mM Tris, 5 mM MgCl, pH 7.4, final vol- 
ume 500 nL). Transformation reactions were 
initiated by the addition of 50 uwL of cofactor 
solution (NADPH, Carl Roth, final concentra- 
tion 1 mM). After time intervals of 0, 30, and 
60 min for ‘9087, “7075, and PS75 or 0, 15, 30, 
and 60 min for rotigotine and imipramine, 
respectively, 100 uL aliquots of the reaction 
mixtures were added to 100 uL ice-cold ace- 
tonitrile (containing 10 uM chlorpromazine as 
internal standard) to terminate enzymatic reac- 
tions. Precipitated protein was removed by cen- 
trifugation (1 min, 16,000 relative centrifugal 
force) and the supernatants were analyzed by 
high-performance liquid chromatography- 
mass spectrometry (HPLC/MS) on a Thermo 
Scientific Dionex Ultimate 3000 HPLC sys- 
tem equipped with a Zorbax Eclipse XDB-C8 
column (4.6 x 150 mm, 5 um), a DAD detector 
(210 nm, 230 nm, 254 nm, 310 nm), anda 
BRUKER amaZon SL mass spectrometer with 
ESI source. The following binary eluent system 
(methanol in water + 0.1% (v/v) formic acid) 
was used: 10% for 1 min, 10 to 100% in 20 min, 
100% for 5 min, 100 to 10% in 2 min, 10% for 
2 min, flow 0.4 mL/min. Per compound, four 
(rotigotine, imipramine) or five (9087, ‘7075, PS75) 
independent experiments were performed. 
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Control experiments were conducted in the 
absence of cofactor solution to determine non- 
specific binding to matrix. The integral (AUC) 
of the extracted ion chromatograms (EIC) 
was used to analyze the concentration of the 
remaining substrates. Concentrations were 
plotted in their logarithmic form as a func- 
tion of the incubation time (in minutes) to 
calculate the elimination rate constant (%) and 
to determine the half-life (7,2) and intrinsic 
hepatic clearance (CLj,;) for each compound 
with the following equations (95) 


: In(2 
Ty/2 [min] = m(2) (5) 
L In(2 
CLint q a 5 = ( ) x 
min x mg(protein) Ty/2 


V(of incubation in uL) 
m(of protein in incubation in mg) 


(6) 


In vivo methods 
Animals and ethical compliance 


Animal experiments were approved by the 
UCSF Institutional Animal Care and Use Com- 
mittee and were conducted in accordance 
with the NIH Guide for the Care and Use of 
Laboratory animals (protocol no. AN181214). 
Adult (8 to 10 weeks old) male C56BL/6 mice 
(strain no. 664) were purchased from the 
Jackson Laboratory. Mice were housed in cages 
on a standard 12:12 hour light-dark cycle with 
food and water ad libitum. The a2,AR D79N 
mutant mice were purchased from Jackson 
(stock no. 2777), and 7- to 8-week-old females 
were used. Sample sizes were modeled on our 
previous studies and on studies using a similar 
approach, which were able to detect signif- 
icant changes (96, 97). The animals were ran- 
domly assigned to treatment and control 
groups. Animals were initially placed into 
one cage and allowed to freely run for a few 
minutes. Then each animal was randomly 
picked up, injected with compound treatment 
or vehicle, and placed into a separate cylinder 
before the behavioral test. 


In vivo compound preparation 


All ligands were synthesized by Enamine 
(‘2998 and ‘7075) or WuXi (‘9087, ‘4622, and 
‘0172) or in-house (PS75) and dissolved 30 min 
before testing. Available salt forms were used to 
aid solubility: HCl for ‘9087 and “7075, TFA for 
‘4622, ‘0172, and PS75. ‘9087, ‘4622, and ‘0172 
were resuspended in 20% Kolliphor (Sigma- 
Aldrich; cat. no. C5135) for s.c. and ip. injections. 
‘2998, “7075, and PS75 were resuspended in 20% 
cyclodextran (Sigma-Aldrich; cat. no. H107) for 
s.c. and i.p. injections. Atipamezole (Cayman 
Chemical Company; cat. no. 9001181) and 
Dexmedetomidine (Cayman Chemical Company; 
cat. no. 15581) were resuspended with NaCl 0.9% 
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(Teknova; cat. no. S5819) for s.c. and i-p. injec- 
tions. ‘9087 was formulated with 40% Captisol 
(Carbosynth; cat. no. OC15979) for p.o. dosing. 


Behavioral analyses 


For all behavioral tests, the experimenter was 
always blind to treatment. Animals were first 
habituated for 1 hour in Plexiglas cylinders 
and then tested 30 min after s.c. injection of 
the o2,AR compounds. The a2,AR antagonist 
atipamezole (2 mg/kg, i.p.) was injected 15 min 
before s.c. injection of the o2,AR agonists. The 
mechanical (von Frey), thermal (Hargreaves, hot 
plate, and tail flick) and ambulatory (rotarod) 
tests were conducted as described previously 
(98). Hindpaw mechanical thresholds were 
determined with von Frey filaments using the 
up-down method (99). Hindpaw thermal sensi- 
tivity was measured with a radiant heat source 
(Hargreaves) or a 55°C hot plate. For the tail 
flick assay, sensitivity was measured by im- 
mersing the tail into a 50°C water bath for both 
WT and D79N mutant mice. For the ambula- 
tory (rotarod) test, mice were first trained on 
an accelerating rotating rod, three times for 
5 min, before testing with any compound. 


SNI model of neuropathic pain 


Under isoflurane anesthesia, two of the three 
branches of the sciatic nerve were ligated and 
transected distally, leaving the sural nerve intact. 
Behavior was tested 7 to 14 days after injury. 


CFA 


The CFA model of chronic inflammation was 
induced as described previously (100). Briefly, 
CFA (Sigma) was diluted 1:1 with saline and 
vortexed for 30 min. When fully suspended, 
we injected 20 pL of CFA into one hindpaw. 
Heat thresholds were measured before the 
injection (baseline) and 3 days after using the 
Hargreaves test. 


Constipation assay 


Mice had access to food and water ad libitum 
before the test. On the test day, mice received 
an i.p. injection of a solution (100 uL) con- 
taining saline, 10 mg/kg morphine, 30 ug/kg 
dexmedetomidine, or 5 mg/kg ‘9087 and then 
individually placed in a clean cage, with no 
access to food or water. Fecal pellets were col- 
lected and counted every hour, up to 6 hours. 


Body weight measurement 


The body weights were measured before, 
24 hours after, and 48 hours after mice re- 
ceived an i.p. injection of a solution (100 uL) 
containing dexmedetomidine (30 ug/kg) or 
‘9087 (5, 10, or 20 mg/kg). 


Pharmacokinetics 


Pharmacokinetic experiments were performed 
by Bienta (Enamine Biology Services) in ac- 
cordance with Enamine pharmacokinetic study 
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protocols and Institutional Animal Care and 
Use Guidelines (protocol number 1-2/2020). 
Plasma pharmacokinetics and brain distribu- 
tion for ‘9087, ‘2998, ‘4622, “7075, PS75, and 
CSF distribution for “7075, PS75, ‘9087, and 
‘4622, were measured after a 10-mg/kg (i-p.) 
dose. Plasma and brain samples were also 
collected for ‘9087 after a 10-mg/kg i.v. and 
30-mg/kg p.o. (oral) dose to determine oral 
bioavailability. In each compound study, nine 
time points (5, 15, 30, 60, 120, 240, 360, 480, 
and 1440 min) were collected, each of the time 
point treatment group included three animals. 
There was also a control group of one animal. 
In the ‘9087, “7075, and ‘4622 studies, male 
C57BL/6N mice were used, for PS75 CD-1 mice, 
and for ‘2998 male Balb/cAnN mice. For all 
compound studies the animals were randomly 
assigned to the treatment groups before the 
pharmacokinetic study; all animals were fasted 
for 4 hours before dosing. For injections, ‘9087 
was dissolved in Captisol - water (40%:60%, 
w/v), ‘4622 was dissolved in a 20% Kolliphor 
HS - physiological saline solution, and “7075, 
PS75, and ‘2998 were dissolved in a 20% 
2-HPBCD-aqueous solution. The batches of 
working formulations were prepared 10 min 
before the in vivo study. 

Mice were injected i.p. with 2,2,2- tribro- 
moethanol at 150 mg/kg before drawing CSF 
and blood. CSF was collected under a stereo- 
microscope from cisterna magna using 1-mL 
syringes. Blood collection was performed from the 
orbital sinus in microtainers containing K3EDTA. 
Animals were euthanized by cervical disloca- 
tion after the blood samples collection. Blood 
samples were centrifuged 10 min at 3000 rpm. 
Brain samples (right lobe) were weighed and 
transferred into 1.5 mL tubes. All samples were 
immediately processed, flash-frozen and stored 
at -70°C until subsequent analysis. 

Plasma samples (40 uL) were mixed with 
200 uL of internal standard (IS) solution. After 
mixing by pipetting and centrifuging for 4 min 
at 6000 rpm, 4 uL of each supernatant was in- 
jected into the liquid chromatography-tandem 
mass spectrometry (LC-MS/MS) system. Sol- 
utions of internal standards were used to 
quantify compounds in the plasma samples. 
Brain samples (weight 200 + 1 mg) were ho- 
mogenized with 800 uL of an internal stock 
solution using zirconium oxide beads (115 + 
5 mg) in a Bullet Blender homogenizer for 30 s 
at speed 8. After this, the samples were cen- 
trifuged for 4 min at 14,000 rpm, and super- 
natant was injected into LC-MS/MS system. 
CSF samples (2 uL) were mixed with 40 pL 
of an internal stock solution. After mixing 
by pipetting and centrifuging for 4 min at 
6000 rpm, 5 uL of each supernatant was in- 
jected into LC-MS/MS system. 

Analyses of plasma, brain, and CSF samples 
were conducted at Enamine/Bienta. The con- 
centrations of compounds in plasma and brain 


14: of 17 


RESEARCH | RESEARCH ARTICLE 


samples were determined using HPLC-MS/MS. 
Data acquisition and system control were per- 
formed using Analyst 1.5.2 software (AB Sciex, 
Canada). The concentrations of the test com- 
pound below the lower limit of quantitation 
(LLOQ = 10 ng/mL for plasma, 20 ng/g for 
brain, and 5 ng/mL for CSF samples) were 
designated as zero. Pharmacokinetic data anal- 
ysis was performed using noncompartmental, 
bolus injection or extravascular input analysis 
models in WinNonlin 5.2 (PharSight). Data 
below LLOQ were presented as missing to 
improve validity of 7,2 calculations. 

Additional pharmacokinetic experiments 
were performed by Sai Life Sciences (Hyderabad, 
India) in accordance with the Sai Study Proto- 
col SAIDMPK/PK-22-04-0340. Brain distri- 
bution of dexmedetomidine was measured 
after a 30-ug/kg ip. dose, using normal saline 
0.9% as its vehicle. Plasma distributions were 
also collected for PS75 after 10-mg/kg iv. and 
30-mg/kg p.o. (oral) dosing to determine oral 
bioavailability; both doses were formulated in 
20% v/v HBCD in saline. Testing was done in 
C57BL/6 mice. For PS75, 24 mice were divided 
into four groups: nine mice for iv. dosing of 
the compound, three mice for i.v. dosing of the 
vehicle, nine mice for p.o. dosing with the 
compound, and three mice for p.o. vehicle 
dosing; sparse sampling of three mice per time 
point for compound-treated groups and one 
mouse per time point for vehicle groups was 
performed. For dexmedetomidine, 36 mice 
were included and split into two groups: three 
mice per time point for compound dosing, 
and one mouse per time point for vehicle- 
only dosing. For PS75, blood samples (60 uL) 
were collected under light isoflurane anesthe- 
sia (Surgivet) from retro orbital plexus from a 
set of three mice at 0.083, 0.25, 0.5, 1, 2, 4, 6, 8, 
and 24 hours. Immediately after blood collec- 
tion, plasma was harvested by centrifugation 
at 4000 rpm, 10 min at 4°C. For dexmedeto- 
midine, brain samples were collected at the 
same time points indicated above. Animals 
were euthanized at respective time points and 
brain samples were isolated and homogenized 
in ice-cold phosphate buffer saline (pH 7.4). 
Total homogenate volume was three times the 
tissue weight. Samples were stored at —70°C 
until bioanalysis. All samples were processed for 
analysis by protein precipitation method and 
analyzed with fit-for-purpose LC-MS/MS meth- 
od (LLOQ = 3.61 ng/mL for plasma for PS75, 
LLOQ = 0.86 ng/mL for brain for dexmede- 
tomidine). The pharmacokinetic parameters 
were estimated using the noncompartmental 
analysis tool of Phoenix WinNonlin software 
(version 8.0). 


Statistical analyses 


Data from functional experiments of adren- 
ergic and Doing receptors were analyzed ap- 
plying the algorithms for four-parameter 
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nonlinear regression implemented in Prism 
8.0 (GraphPad, San Diego, CA) to get dose- 
response curves representing ECs, and Eynax 
values. Mean values were derived by summa- 
rizing the results from each individual exper- 
iment to provide ECs, + SEM and E,;nax + SEM 
(or SD where indicated). Additional statistical 
analyses for Fig. 4; fig. S4; fig. S16, A and C to 
E; and fig. S18 were performed with GraphPad 
Prism 9.0 (GraphPad Software Inc., San Diego). 
Data reported are means + SEMs or, in Fig. 4 
and fig. S18, single data points with means + 
SEMs. Experiments of the compounds in the 
in vivo neuropathic, inflammatory, hot plate, 
tail flick, and rotarod models were evaluated 
using unpaired two-tailed Student’s ¢ test or 
one-way analysis of variance (ANOVA) with 
Dunnett’s multiple comparison post hoc test 
to determine differences between groups. Ex- 
periments for body weight and constipation 
were analyzed with a two-way ANOVA. Details 
of the analyses, including groups compared 
in statistical sets, number of animals per 
group, and P values, can be found in the figure 
legends. 
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INTRODUCTION: Organisms from bacteria to 
animals and plants must defend themselves 
against pathogens. Homologous protein mo- 
tifs exist in immune pathways of all organisms. 
One such motif is the TIR domain, named after 
the mammalian immune receptors—Toll-like 
receptors and interleukin-1 receptors—where 
it was first identified. Two properties are 
shared among most TIR domains from all 
organisms: the ability to self-associate and 
enzymatic activity involving the cleavage of 
nicotinamide adenine dinucleotide (oxidized 
form) (NAD*). NAD* is a metabolite with redox 
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properties that has roles in many cellular 
processes. In some cases, cleavage of NAD* 
leads to the production of cyclic adenosine 
monophosphate (ADP)-ribose (CADPR) isomers. 


RATIONALE: In bacteria, NAD*-cleavage activ- 
ity by TIR domain-containing proteins plays 
a role in defense signaling, as well as sup- 
pression of host immunity. One correspond- 
ing pathway is termed the Thoeris defense 
system. This signaling pathway protects bac- 
teria against phage infection and involves the 
thsA and thsB genes. Upon phage infection, 
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2'cADPR 


Functions in 
bacteria and 
plant immunity? 


Activation of the Thoeris 
antiphage defense system 
in bacteria 


Suppression 
of plant 
immunity 


Diverse immune roles of bacterial cADPR isomers. Bacteria have TIR domain-containing proteins that cleave 
NAD to produce cyclic ADPR isomers with different cyclic linkages. One of these molecules, 3'cADPR, has roles in 
diverse immunity pathways. It acts as an activator of the Thoeris antiphage defense system by binding to the protein 
ThsA. When produced by the effector HopAM1 from the plant pathogen P. syringae, it suppresses plant immunity. 
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ThsB (a TIR-domain protein) cleaves NAD* 
and produces a cADPR isomer, which activ- 
ates ThsA-mediated killing of the infected cell, 
thus protecting the bacterial population. An- 
other bacterial protein that produces a cADPR 
isomer is HopAM1, the TIR-domain effector 
protein from Pseudomonas syringae DC3000, 
which is involved in suppressing plant immu- 
nity. The chemical structures and mechanisms 
of action of the responsible cADPR isomers 
were unknown before this work. Our aim was 
to determine the chemical structures of cADPR 
isomers, the structural basis of their production 
by bacterial TIR domains, and their mechanism 
of action in Thoeris defense signaling and sup- 
pression of plant immunity. 


RESULTS: Using a combination of methods, 
including nuclear magnetic resonance (NMR), 
mass spectrometry, and crystallography, we 
show that the cADPR isomers are cyclized 
by O-glycosidic bond formation between the 
ribose moieties in ADPR. Structures of TIR 
domains that produce cADPR isomers, as 
determined by crystallography and cryo- 
electron microscopy, reveal conformational 
changes that lead to an active assembly that 
resembles those of Toll-like receptor adaptor 
TIR domains. Mutagenesis reveals a conserved 
tryptophan that is essential for cyclization. 
Using crystallography and biophysical ap- 
proaches, we show that one of the cADPR 
isomers (3’cADPR) is an activator of Thoeris 
ThsA proteins responsible for antiphage de- 
fense, by inducing a change in its tetrameric 
state. We also show that the same cADPR iso- 
mer is a suppressor of plant immunity when 
produced by the effector HopAM1. 


CONCLUSION: Collectively, our results reveal 
the molecular basis of cADPR isomer pro- 
duction. The 2’cADPR and 3’cADPR differ only 
in the location of the O-glycosidic bond be- 
tween the ribose moieties in ADPR. These 
compounds add to the growing list of sig- 
naling molecules identified in immune path- 
ways that involve proteins containing TIR 
domains and may represent intermediates 
in their synthesis or signaling molecules with 
their own distinctive activities. Our results 
establish the 3’cADPR isomer produced by 
bacterial TIR domain-containing proteins as 
an antiviral and plant immunity-suppressing 
signaling molecule. 
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Cyclic adenosine diphosphate (ADP)-ribose (cADPR) isomers are signaling molecules produced by 
bacterial and plant Toll/interleukin-1 receptor (TIR) domains via nicotinamide adenine dinucleotide 
(oxidized form) (NAD*) hydrolysis. We show that v-cADPR (2'cADPR) and v2-cADPR (3'cADPR) isomers 
are cyclized by O-glycosidic bond formation between the ribose moieties in ADPR. Structures of 
2'cADPR-producing TIR domains reveal conformational changes that lead to an active assembly that 
resembles those of Toll-like receptor adaptor TIR domains. Mutagenesis reveals a conserved tryptophan 
that is essential for cyclization. We show that 3'cADPR is an activator of ThsA effector proteins from the 
bacterial antiphage defense system termed Thoeris and a suppressor of plant immunity when produced by 
the effector HopAM1. Collectively, our results reveal the molecular basis of cADPR isomer production and 


establish 3'cADPR in bacteria as an antiviral and plant immunity-suppressing signaling molecule. 


he ~150-residue Toll/interleukin-1 recep- 

tor (TIR) domains are widely distributed 

in animals, plants, and bacteria and func- 

tion through self-association and homo- 

typic interactions with other TIR domains 
(2). In plants and animals, these domains are 
predominantly found in proteins with immune 
functions [Toll-like receptors (TLRs), interleukin-1 
receptors (IL-1Rs), and their adaptor proteins] 
(2) and plant nucleotide-binding, leucine-rich 
repeat receptors (NLRs) (3, 4). TIR domains 
form higher-order oligomers and orchestrate 
signal amplification by a mechanism called 
signaling by cooperative assembly formation 
(SCAF) (3, 5-7). 

In bacteria, initial studies suggested that 
TIR domain-containing proteins, such as 
TcpB, TcpC, TirS, PumA, and TcpS, serve as 
virulence factors that inhibit host innate 
immune signaling by molecular mimicry, 
although it is not clear how they enter the 
cell (8). Bacterial TIR domain-containing 
proteins are further implicated in antiphage 
defense systems (9-12). 


Many TIR domains have been found to cleave 
nicotinamide adenine dinucleotide (oxidized 
form) (NAD*) (13-16). In animals, SARM1 (ster- 
ile alpha and Toll/interleukin-1 receptor motif- 
containing 1) is a metabolic sensor of axonal 
nicotinamide mononucleotide (NMN) and NAD* 
levels, and its TIR domain executes programmed 
axon death by cleaving NAD* into nicotinamide 
and either adenosine diphosphate (ADP)-ribose 
(ADPR) or cyclic ADPR (cADPR) (73, 16, 17). The 
TIR domains of plant NLRs (TNLs) similarly 
cleave NAD* into nicotinamide and either ADPR 
or a cADPR isomer known as variant cADPR 
(v-cADPR) (74), which has a different but un- 
known cyclization site compared with canonical 
cADPR (J8). A conserved catalytic glutamate is 
required for NAD* hydrolysis by SARM1 and 
plant TIR domains, and the same glutamate is 
required for SARM1 to trigger axon degeneration 
and plant TNLs to trigger a localized cell death 
in response to infections (13, 14, 16). v-cADPR 
is a biomarker of plant TNL enzymatic activity 
but may not be sufficient in and of itself to 
trigger a cell-death response (19). 


Bacterial TIR domains can also cleave NAD* 
(10, 15, 20-22), again producing nicotinamide 
and ADPR or the distinct cADPR isomers 
v-cADPR or v2-cADPR. In bacteria, NAD nu- 
cleosidase (NADase) activity by TIR domains 
is a critical component of STING cyclic dinu- 
cleotide sensing (J0) and the Thoeris defense 
system (9). The Thoeris system protects bacte- 
ria against phage infection and consists of two 
genes: thsA and thsB (the latter can be present 
in one or multiple copies). The t2sA gene encodes 
a protein that contains either an N-terminal 
SIR2-like (silent information regulator 2) do- 
main and a C-terminal SLOG-like domain (from 
the superfamily of ribonuclease like proteins 
named after SMF/DprA and LOG proteins) or a 
multitransmembrane N-terminal domain with 
a C-terminal macro domain. SIR2 domains 
hydrolyze NAD*, whereas SLOG domains can 
bind to NAD*-derived metabolites and have 
been suggested as sensors for nucleotide-related 
ligands (9, 23). The thsB gene encodes a TIR 
domain-containing protein (ThsB) (9, 22-24). 
Upon phage infection, ThsB cleaves NAD* and 
produces a cADPR isomer, which activates 
ThsA (22), causing further NAD* depletion and 
cell death. HopAM1, a bacterial TIR-domain ef- 
fector protein from Pseudomonas syringae 
DC3000, suppresses plant immunity by produc- 
ing v2-cADPR (27). 

TIR domain self-association is required for 
the NADase activity of SARM1 as well as plant 
TNLs (73, 16). Crystal and cryo-electron mi- 
croscopy (cryo-EM) structures of SARM1 in its 
active conformation revealed the active site 
spanning two TIR-domain molecules, explain- 
ing the requirement for self-association (25). 
The architectures of oligomeric assemblies 
formed by plant and SARM1 TIR domains 
are similar (“enzyme assemblies”) but are dis- 
tinct from those formed by animal TIR domains 
involved in TLR signaling (“scaffold assem- 
blies”) [reviewed in (3)]. Both types of assem- 
blies feature open-ended complexes with two 
strands of TIR domains in a head-to-tail ar- 
rangement but differ in the orientation of the 
two strands (antiparallel in enzyme assemblies, 
parallel in scaffold assemblies). Some plant 
TIR domains have also been shown to act 
as 2',3'-cAMP and 2’,3'-cGMP synthetases, by 
hydrolyzing RNA and DNA, which requires a 
different TIR-domain oligomeric architecture 
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(26) (cAMP, cyclic adenosine monophosphate; 
cGMP, cyclic guanosine monophosphate). A 
limited number of bacterial TIR domains 
have been characterized structurally and 
functionally (23, 27-30). In this study, we 
characterized the cyclization site of cADPR 
isomers, the molecular basis of their pro- 
duction by TIR domains, and their func- 
tions in immunity pathways in bacteria and 
plants. 


O-glycosidic linkage between ribose sugars 
in cADPR isomers 


To identify the sites of cyclization in cADPR iso- 
mers, we expressed and purified the TIR do- 
mains of AbTir (from Acinetobacter baumannii; 
AbTir™®) and a protein from Aquimarina 
amphilecti (AaTir''®; fig. SLA and table S1), 
which produce v- and v2-cADPR, respec- 
tively (15, 31). Comparative genomics analyses 


suggest that AbTir has functions distinct 
from the Thoeris defense system (fig. S1B). 

Real-time nuclear magnetic resonance (NMR)- 
based and fluorescence-based NADase assays 
confirmed that purified AbTir™™® and AaTir™™ 
are enzymatically active and produce cADPR 
isomers (Fig. 1A and fig. $1, C and D). AbTir™*- 
produced v-cADPR is identical to the cADPR 
isomer produced by TIR domains of the plant 
immune receptors L6 and ROQI (Fig. 1A). 
Both AbTir™® and AaTir™® can catalyze base- 
exchange reactions with 8-amino-isoquinoline 
(fig. S2A). They can also use nicotinamide ade- 
nine dinucleotide phosphate (NADP") as a sub- 
strate, but its cleavage only yields nicotinamide 
and adenosine diphosphate ribose 2’-phosphate 
(ADPPR), indicating that cADPR-isomer pro- 
duction is specific to NAD* cleavage (fig. S2B). 
We also tested the activities in the presence of 
NAD*, adenosine triphosphate (ATP), and Mg”* 


and observed no formation of metabolites re- 
ported for plant TIR domains (32, 33) (fig. SIF). 

We purified v-cADPR and v2-cADPR from 
the reaction mixture, using high-performance 
liquid chromatography (HPLC), and deter- 
mined their chemical structures using NMR 
(Fig. 1B; figs. SIE and S2, C to D; and tables S2 
and S3). The heteronuclear multiple bond cor- 
relation (HMBC) NMR spectrum of AbTir™®- 
produced v-cADPR shows both H1’-C2’ and 
H2'-Cl” cross-peaks, revealing that AbTir™™ cat- 
alyzes the formation of a ribose(1”—2’)ribose 
O-glycosidic linkage in ADPR (Fig. 1B). By 
contrast, the HMBC spectrum of v2-cADPR, 
produced by AaTir™®, shows both H1’-C3’ and 
H3’-Cl” cross-peaks, indicating that AaTir™™ 
catalyzes the formation of a ribose(1”—3’)ribose 
O-glycosidic linkage in ADPR (Fig. 1, B and C). 
Because the £8 configuration of the anomeric 
carbon in NAD* is retained in AbTir™® and 
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1H-13C-HMBC spectrum of 2’cADPR (v-cADPR) 


Fig. 1. Chemical structures of v- and v2-cADPR. (A) Expansions of 'H NMR 
spectra showing NADase reactions by 0.1 uM AbTir™® (A. baumannii), 100 uM 
L6"® (flax), 26 wM ROQI"® (N. benthamiana), 0.5 uM AaTir"® (A. amphilecti), and 
2.5 uM BtTir"® (B. thetaiotaomicron; table S1). The initial concentration for NAD* was 
500 uM, except for L6 (1 mM). Spectra correspond to 16 hours of incubation 
time, except for ROQ1 (incubation time of 64 hours). Selected peaks are labeled, 
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1H-183C-HMBC spectrum of 3’cADPR (v2 c-ADPR) 


showing the formation v-cADPR (v) and v2-cADPR (v2). (B) Expansions of HMBC 
spectra showing correlations through glycosidic linkages for 2'cADPR (v-cADPR) 
and 3'cADPR (v2-cADPR). ppm, parts per million. (©) Chemical structures 

of 2'cADPR, 3'cADPR, cADPR, and ADPR. Important NMR peaks (B) and their 
correlated positions in the chemical structure (C) are labeled, showing a 

1"-2' linkage for v-cADPR and a 1"-3' linkage for v2-cADPR. 
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AaTir™*-catalyzed base-exchange reactions 
(fig. S2A) and the coupling constants of the 
anomeric protons of v-cADPR (J1",2” ~ 5.0 Hz) 
and v2-cADPR (J1",3” ~ 4.4 Hz) are similar to 
those of NAD* (5 to 6 Hz), both cADPR isomers 
are likely to retain the same f configuration. 
v2-cADPR purified from Nicotiana benthamiana 
leaves expressing HopAM1 showed an identi- 
cal chemical structure to that of AaTir™™® (table 
S4). Based on these chemical structures, we 
term these molecules 2‘’cADPR and 3’cADPR, 
highlighting the linkages between ribose rings 
of v-cADPR and v2-cADPR, respectively. 


Ligand-free structures and self-association 
of bacterial TIR domains 


SARM1 and plant TIR domains require self- 
association for their NADase activity (13, 25, 34, 35). 
The NADase activity of AbTir™™ increases dis- 
proportionally with increasing protein con- 
centrations and increases in the presence of 
molecular crowding agents (fig. S3, A and B). 
Size-exclusion chromatography coupled with 
multiangle light scattering (SEC-MALS) exper- 
iments show that AbTir self-associates in a 
concentration-dependent manner (fig. S3, C 
and D), which requires both the TIR domain 
and the N-terminal coiled coil (CC) domain 
(AbTir®). AbTir™2 shows an initially sup- 
pressed enzymatic activity, suggesting that con- 
formational rearrangements are required for 
efficient NAD* cleavage (fig. SIC). Taken together, 
these results suggest that cADPR isomer- 
producing bacterial TIR domains also require 
self-association for their NADase activity. 

To provide insight into the molecular basis 
of cADPR isomer production by TIR domains, 
we determined the crystal structures of AbTir™® 
and a closely related v-cADPR (2’cADPR)- 
producing TIR domain protein (47% sequence 
identity) from Bacteroides thetaiotaomicron 
(residues 156 to 287; BtTir™; Fig. 1, figs. SIA 
and S2A, and table Si), at 2.16- and 1.42-A 
resolution, respectively (fig. S4, A and B, and 
table S5). The structure of AbTir™™ closely re- 
sembles the structures of BtTir™’, PdTir™™® 
[Protein Data Bank (PDB) ID 3H16], and the 
v-cADPR (2'’cADPR)-producing TcpB" (PDB 
ID 4C7M) (27-30), with Co root mean square 
distance (RMSD) values of 1.2, 1.8, and 1.9 A, 
respectively (fig. S4, B and C). Both AbTir™®™ 
and BtTir’™ show less similarity to the cADPR 
isomer-producing ThsB TIR domain of the Bacil- 
lus cereus MSX-D12 Thoeris defense system 
(BcThsB; PDB ID 6LHY; Ca RMSD values of 4.4 
and 2.5 A for AbTir™ and BtTir™", respectively). 

In TIR domains with NADase activity (from 
SARM1; plant TNLs RPP1, ROQI, and RUNI; 
and oyster TIR-STING; table S1), the conserved 
glutamate residue essential for NADase activ- 
ity is localized in a pocket consisting of residues 
from the BA strand, the AA and BB loops, and 
the oB and aC helices (10, 13, 25, 34). In AbTir™® 
and BtTir™®, the oB and aC helices adopt 
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different conformations and the equivalent 
glutamate residue (E208 in AbTir; E230 in 
BtTir) is surface exposed (fig. S4D). Compar- 
ing the different chains in the asymmetric unit 
of the AbTir™® crystal, the region around E208 
is highly flexible. Crystal packing of AbTir™™® 
and BtTir™® reveals a common symmetric in- 
terface (figs. S4C and S5A), with a large buried 
surface area (BSA) (AbTir™®, 1875 A?; BtTir™, 
1332 A°), which is also observed in the crystals 
of PdTir™® and TepB"™™ (fig. S5A and table S1). 
The structures suggest that they represent an 
inactive conformation, possibly stabilized by 
the symmetric dimeric arrangement. 


Cryo-EM structure of AbTir™® bound 

to NAD* mimic 

We reasoned that we could capture the active 
state of AbTir by using base-exchange products 
of NAD* hydrolysis, which are more biochem- 
ically stable than NAD* itself and could resist 
cleavage at high concentrations of the en- 
zyme (25). Indeed, in the presence of 8-amino- 
isoquinoline adenine dinucleotide (8AD), we 
could visualize filamentous structures of AbTir™ 
by negative-stain electron microscopy (Fig. 2A). 
Filaments could be observed in the presence of 
NAD*, but too few to support structure deter- 
mination (fig. S6A). We determined the cryo-EM 
structure of AbTir™®:3AD filaments at 2.74-A 
resolution (fig. S6 and table S6), which shows 
the presence of 3AD molecules between mono- 
mers of AbTir™® and an arrangement of TIR 
domains different from the enzyme assem- 
blies of SARM1 and plant TIR domains but 
analogous to the scaffold assemblies of the 
TLR adaptor proteins MAL and MyD88, each 
of which contains two parallel strands of TIR 
domains arranged head to tail (Fig. 2, B and D, 
and fig. S5B) (3). The intrastrand “BE” interface 
(BSA of 710 A’) involves the BB-loop, whereas 
the interstrand “BCD” interface involves one 
molecule interacting with two molecules in 
the parallel strand (BSAs of 1340 and 1410 A’, 
respectively). Assembly formation is accompa- 
nied by conformational changes involving the 
BB-loop and the oB and oC helices (RMSD of 
1.8 A for 101 Ca atoms; Fig. 2C and movie S1). 
The BB-loop and aB helix have tilted outward 
by ~50° to 55°, whereas the oC helix has re- 
folded and includes residues from both the CC 
and CD loops. Because it is arranged through 
the analogous BE interface, the AbTir™™ active 
site is very similar to that of SARM1""%, includ- 
ing the conformation of the 3AD molecule 
(Fig. 2E). Using site-directed mutagenesis, we 
verified the importance of residues in both the 
active-site region and in the inter- and intra- 
strand interfaces for NADase activity of AbTir™ 
(Fig. 3A, fig. S7, and table $7). 


Tryptophan affects ADPR cyclization 


The products of the NADase reaction were 
also assessed for the AbTir mutants (table S7). 
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The analysis revealed that only the W204A 
(Trp***—-Ala) mutant has reduced production 
of v-cADPR (2’cADPR) compared with wild- 
type AbTir, demonstrating its importance for 
ADPR cyclization (Fig. 3B, fig. S8A, and table 
S7). We tested whether the mutation of the 
equivalent tryptophan residue has a similar 
effect on ADPR cyclization by other bacterial 
TIR domain-containing proteins. We ana- 
lyzed the conservation of W204 in a multi- 
ple sequence alignment of 122 TIR domains 
(15, 16, 31). We observed conservation of aro- 
matic residues at this position among all TIR 
domains that produced a cyclic ADPR product 
[cADPR, v-cADPR (2'cADPR), or v2-cADPR 
(3'cADPR)] in vitro, with a strong preference 
for tryptophan—W (9 of 12), Y (Tyr) (2 of 12), 
and F (Phe) (1 of 12) (fig. S8B). This conserva- 
tion is weaker among TIR domains that either 
produce noncyclic products (nicotinamide 
and ADPR) or lack activity altogether in vitro. 
These observations suggest that a large aro- 
matic residue at this position facilitates the 
production of a cyclic ADPR product. 

To further identify positions important for 
determining the product specificity of TIR- 
domain NADases, we analyzed 278 TIR-domain 
sequences and confirmed that the position 
corresponding to the tryptophan differs be- 
tween cyclase and noncyclase TIR domains, 
although not among cyclase TIR domains 
that produce different forms of cADPR 
(data S1 to S4). All cyclase TIR domains 
contain an aromatic residue at this position 
(fig. S9, A to E). 

We tested the functional importance of the 
conserved aromatic residue by performing site- 
directed mutagenesis on TIR domains known 
to produce different products (fig. S8, C and D), 
with conservative (W, Y, or F) or disruptive (A) 
mutations. Reaction products were analyzed by 
HPLC (fig. S8, C and D). For all TIR domains 
that produce a cyclic product, the mutants ex- 
hibited a decrease in cyclic product formation 
and an increase in ADPR when compared with 
the wild-type proteins (Fig. 3, C to F). The non- 
conservative alanine mutations typically exhib- 
ited the greatest impact on the relative production 
of a cyclic product versus ADPR; however, this 
trend was not universally true. Mutations in 
the background of an ADPR-producing tem- 
plate had variable impacts on NADase activity 
(fig. SSE), but ADPR always remained the only 
catabolite produced. These results are consistent 
with the conclusion that the position equivalent 
to W204 in AbTir is critical for the production 
of cyclic ADPR products. 


3’cADPR (v2-cADPR) activates Thoeris ThsA 

Bacterial lysates containing cADPR isomers 
produced by bacterial ThsB or plant TIR do- 
mains have been shown to activate the ThsA 
NADase (Fig. 4A) of the B. cereus MSX-D12 
Thoeris antiphage defense system (22). To test 
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AbTir™R 
Fig. 2. Cryo-EM structure of the AbTir™®:3AD filament. (A) Negative-stain 
electron micrograph of AbTir™®:3AD filaments. (B) Cartoon representation and 
electrostatic potential density map of the AbTir"®:3AD filament. TIR domain 
subunits are shown in blue, green, magenta, and orange. 3AD is shown in yellow. 
(C) Structural superposition of ligand-free (crystal structure, orange) and 


if our purified cADPR isomers can directly ac- 
tivate ThsA, we produced and purified four 
different ThsA proteins (37 to 46% sequence 
identity) from B. cereus MSX-D12 (BcThsA), 
A. baumannii (AbThsA), Enterococcus faecium 
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(EfThsA), and Streptococcus equi (SeThsA) and 
monitored the NAD*-cleavage activity of each 
in the absence and presence of ADPR, cADPR, 
v-cADPR (2’cADPR), and v2-cADPR (3'cADPR) 
using our NMR-based NADase assay (Fig. 4, B 
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cj AdTI"R 
es AbTir™®:3AD 


SARM17™R (7NAI) 


3AD-bound AbTir"® (cryo-EM structure, cyan) molecules reveal conformational 
changes in the BB-loop and aB and aC helices upon substrate binding. 

(D) Comparison of AbTir"®:3AD, MAL"® (PDB ID 5UZB) and SARM1"®:1AD 
(PDB ID 7NAK) assemblies. (E) Comparison of the active sites in AbTir™® and 
SARM1"® (PDB ID 7NAI) reveals similar substrate-binding modes. 


and C, and fig. $10, A and B). AbThsA, BcThsA, 
and a SIR2 domain-only construct of BcThsA 
(BcThsA*) cleave NAD‘ (fig. S10, A to C), where- 
as EfThsA, SeThsA, and SeThsA*"? (Fig. 4, B 
and C, and fig. S10C) are almost inactive under 
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Fig. 3. Mutagenesis of bacterial TIR domains. (A) NADase activity of AbTir 
mutants, using the fluorescence-based assay, with 100 uM eNAD and 100 uM 
protein. Data are presented as means + SD (n = 3). (B) Production of 2’cADPR 
and ADPR by wild-type AbTir"® and its W204A mutant, monitored by 'H NMR, 
using an initial NAD* concentration of 500 uM. (C to F) Mutations of the position 


the same conditions (less than 10% of NAD* 
consumed after 40 hours). Neither BcThsA nor 
AbThsA are further activated by ADPR, cADPR, 
v-cADPR (2’cADPR), or v2-cADPR (3’cADPR) (fig. 
SIOB), suggesting that they have been produced 
in a fully activated state. Both autoactive and 
inactive versions of ligand-free BcThsA have 
previously been reported (22, 23), suggesting 
that the inactive states of some ThsA variants 
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PROVRUST_05034 


TIR 


are sensitive to the choice of purification method 
or tag. EfThsA and SeThsA cleave NAD* in 
the presence of 500 uM v-cADPR (2'cADPR) 
or v2-cADPR (3’cADPR) (Fig. 4, B and C). A 
dose-response is observed with v2-cADPR 
(3’cADPR) treatment, and both proteins are 
activated by v2-cADPR (3’cADPR) concen- 
trations as low as 5 uM (Fig. 4C). In com- 
parison, v-cADPR (2'cADPR) only activates 
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equivalent to AbTir W204 affect the production of cyclic NAD* catabolites by 
TIR domains (table S1). The NAD* catabolite peak areas for wild-type and mutant 
TIR domain reactions after 1 hour are shown. TIR domain grouping is based 

on the primary product of the wild-type protein [n = 3 for all groups except where 
no data (ND) could be collected; data are presented as means + SD]. 
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these proteins at the highest concentration 
tested (500 uM). ADPR and cADPR have no 
effect on the NADase activity of EffhsA and 
SeThsA (Fig. 4B). 

Isothermal titration calorimetry (ITC) mea- 
surements showed that v2-cADPR (3’cADPR) 
binds directly to inactive (EfThsA) and auto- 
active (AbThsA) forms of ThsA at a ~1:1 mo- 
lar ratio, with dissociation constant (Kg) 
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Fig. 4. Binding of v2-cADPR (3’cADPR) to ThsAS“°%, (A) Schematic diagram 
of ThsA domain organization. Residue numbering corresponds to BcThsA 
(domain boundaries: BcThsA-SIR2, 1 to 283; SLOG, 284 to 476). AbThsA, EfThsA, 
and SeThsA have identical domain organizations. (B) Activation of EffhsA 

(0.5 uM) and SeThsA (10 uM) NADase activity by 500 uM ADPR, cADPR, v-cADPR 
(2'cADPR), and v2-cADPR (3'cADPR). The initial NAD* concentration was 500 uM. 
(C) Activation of EfThsA (0.5 uM) and SeThsA (10 uM) NADase activity by 0.5, 
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5, 50, and 500 uM v-cADPR (2'cADPR) and v2-cADPR (3'cADPR). The initial NAD* 
concentration was 500 uM. (D) Crystal structure of BcThsASL°° dimer (cartoon; 
chains colored in slate and magenta) in complex with v2-cADPR (3'cADPR) (green 
stick representation). (E) Standard omit mFo-DFc map of v2-cADPR (3'cADPR), 
contoured at 3.0 o. (F) Enlarged cutaway of the v2-cADPR (3'cADPR) binding pocket 
in the BcThsAS“°S structure. (G) Surface representation of the v2-cADPR (3'cADPR) 
binding pocket. 


values of 59.1 + 15.8 and 189 + 1.6 nM, re- 
spectively (fig. S10, D and E). No binding 
was detected for v-cADPR (2'cADPR) using 
ITC (fig. S10, D and E), and the weaker bind- 
ing affinity of v-cADPR (2'cADPR) was also 
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corroborated by competition binding assays 
through saturation-transfer difference (STD) 
NMR, because v2-cADPR (3’cADPR) reduced 
v-cADPR (2’cADPR) binding to EfThsA, SeThsA, 
BcThsA, and AbThsA at an equal concentra- 
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tion (fig. S10F). Taken together, these findings 
support the model that ThsA is activated by 
TIR domain-produced cADPR isomers, with 
a preference for v2-cADPR (3’cADPR) over 
v-cADPR (2'cADPR). 
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3’cADPR (v2-cADPR) binds to the ThsA 

SLOG domain 

To provide structural insights into cADPR iso- 
mer selectivity by ThsA, we determined the crys- 
tal structure of the SLOG domain of BcThsA 
(BcThsAS"°%) in complex with v2-cADPR 
(3'cADPR) at 1.6-A resolution (Fig. 4D and 
table S5). Continuous electron density for a 
cADPR isomer with a ribose(1"—3’)ribose 
O-glycosidic linkage was observed, confirming 
the structural configuration assigned by our 
NMR assays (Fig. 4E). BcThsA*™S exists as a 
stable dimer in solution (fig. S11 and table S8) 
and forms a symmetric dimer (SLOG dimer) 
in the crystal, with an identical interface to the 
dimer observed in the ligand-free BcThsA struc- 
ture (Fig. 4D and fig. S12A; PDB ID 6LHX) 
(23). Binding of v2-cADPR (3’cADPR) does not 
lead to substantial structural rearrangements 
in either the SLOG domain or the dimer in- 
terface (fig. S12A). 3’cADPR (v2-cADPR) binds 
to a highly conserved pocket adjacent to the 
symmetric dimer interface, and the adenine 
bases of the two v2-cADPR (3’cADPR) mol- 
ecules in the dimer are only separated by 4.5 A 
and are bridged by two water molecules (Fig. 
4F, fig. S12B, and table S9). The C-2 and C-3 
hydroxyls of the distal ribose interact with E403, 
whereas the diphosphate group is involved 
in hydrogen-bonding interactions with $290, 
R371, K388, and the backbone amide and car- 
bonyl of G289 and G399, respectively (S, Ser; 
R, Arg; K, Lys; G, Gly). The adenine base stacks 
against the side chains of L326 and Q359, where- 
as the C-2 hydroxyl of the adenine-linked ribose 
forms a hydrogen bond with the backbone 
amide of L326 (Q, Gln). 2'’cADPR (v-cADPR), 
which has a ribose(1"—2’)ribose O-glycosidic 
linkage, cannot form this latter hydrogen bond, 
and the adenosine moiety of this cADPR iso- 
mer is also likely to encounter steric hindrance 
with binding-pocket residues (Fig. 4G), explain- 
ing the preference for 3’cADPR. Mutational 
analysis confirmed the role of the structure- 
defined binding-pocket residues in ThsA ac- 
tivation (fig. S10G). 


3’cADPR (v2-cADPR) changes ThsA 
tetramer organization 


AbThsA, BcThsA, EfThsA, and SeThsA exist as 
tetramers in solution, and activation of SeThsA 
by v2-cADPR (3’cADPR) does not lead to a 
change in its oligomeric state (fig. S11 and 
table $8). However, the inactive SeThsA*"*” 
exists as a dimer in solution, whereas the fully 
active BcThsA*!®? exists as a monomer, sug- 
gesting that destabilization of SIR2:SIR2 do- 
main interactions within the tetramer may 
be required for triggering the ThsA NADase 
activity (figs. SIOC and S11 and table S8). To 
provide more detailed insight into how ThsA 
is activated by v2-cADPR (3'cADPR), we deter- 
mined the ligand-free crystal structure of in- 
active SeThsA at 3.4-A resolution (table S5) 
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and compared it with the crystal structures of 
autoactive BcThsA (PDB ID 6LHX) (23) and 
our BcThsA®©°:v2-cADPR (3’cADPR) complex. 
Crystal-packing analyses reveal a D2 symmetric 
tetramer with a core consisting of two SIR2 
dimers flanked by SLOG dimers at both sides 
(Fig. 5A). Both SIR2 dimer interfaces involve 
residues from the 03, «7, «9, and a10 helices. 
The crystal structure of autoactive BcThsA 
(PDB ID 6LHX) (23) has a tetramer with an 
identical architecture to SeThsA, but there are 
differences in the SIR2:SIR2 interfaces (Fig. 5, 
A and B). One of the molecules in the BcThsA 
SIR2 dimers has undergone a rotation of ~27° 
and translation of ~14 A, compared with the 
SeThsA SIR2 dimers (Fig. 5C), resulting in a 
decrease of the interface area (2901.8 A® in 
SeThsA; 1334.8 A? in BcThsA). The «3 helix, 
which is involved in SIR2 dimerization but also 
covers a part of the predicted active site region 
in SeThsA (Fig. 5, B and D), adopts a different 
conformation (SIR2“ and SIR2”) or is disor- 
dered (SIR2® and SIR2°) in BcThsA, enabling 
better access to catalytic residues (N113 and 
H153 in SeThsA, and N112 and H152 in BcThsA; 
N, Asn; H, His) (Fig. 5, B and D) (23). Compar- 
ison of the SLOG dimers in the inactive SeThsA 
structure with the dimer in the BcThsAS°: 
v2-cADPR (3’cADPR) complex reveals that v2- 
cADPR (3'cADPR) binding is likely to induce 
changes in the orientation and position of the 
two SLOG domains (Fig. 5, E and F, and movie 
$2). In the SeThsA tetramer, this change will 
most likely cause the SIR2“ and SIR2® domains 
and the SIR2° and SIR2” domains to move in 
opposite directions, bringing the 10 helices 
into closer proximity (Fig. 5G and movie S3), 
adopting a similar configuration to the SIR2 
dimer interface observed in the crystal struc- 
ture of autoactive BcThsA (Fig. 5B). We predict 
that this movement of the SIR2 domains is 
sufficient to destabilize the a3 helix conforma- 
tion, enabling NAD* to access the active sites. 
SeThsA double mutants with reverse-charge 
substitutions of highly conserved residues in 
the SIR2 dimer interface (fig. S12C) are either 
autoactive (E170R, D251R; D, Asp) or not ac- 
tivated by v2-cADPR (3’cADPR) (R166E, R254E), 
confirming the role of the SIR2 dimer inter- 
face in regulating ThsA NADase activity (Fig. 
5H). In summary, these findings reveal the struc- 
tural basis of cADPR isomer selectivity by ThsA 
and demonstrate that v2-cADPR (3’cADPR) ac- 
tivates the NADase function of ThsA by chang- 
ing its tetramer organization. 


Immunity suppression by HopAM1 requires 
v2-cADPR (3’cADPR) production 


3'cADPR (v2-cADPR) is also produced by the 
P. syringae DC3000 effector HopAMI (27). To 
examine HopAM{1’s ability to suppress immu- 
nity, we generated transgenic Arabidopsis plants 
that express HopAM1 and the catalytically null 
mutant HopAM1"™""“. The transgenic plants 
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were challenged with the immunity-inducing 
peptide flg22, and the production of reactive 
oxygen species (ROS), a hallmark of plant immu- 
nity, was quantified. Plants induced to express 
HopAMI with estradiol, but not HopAM1""“ 
or the uninduced plants, showed suppressed 
ROS production (Fig. 6A). The immunity sup- 
pression of HopAM1 correlates with v2-cADPR 
(3'cADPR) production but not NAD* depletion 
(Fig. 6B). 

Arabidopsis FRK1 (flg22-induced receptor-like 
kinase 1) is rapidly induced by multiple microbe- 
associated molecular patterns, and FRKI pro- 
moter luciferase reporter (FRKI-LUC) lines 
provide a dynamic readout of pathogen suppres- 
sion of pattern-triggered immunity (36). Real- 
time imaging of DC3000 or DC3000AhopAMLI/ 
4-2 challenged FRKI-LUC plants shows faster 
suppression of luciferase activity in DC3000 
compared with the HopAM1 mutant (Fig. 6C). 
Fv/Fm, the maximum (dark-adapted) quantum 
efficiency of photosystem II, is a chlorophyll flu- 
orescence parameter that allows nondestruc- 
tive quantitative and spatial measurements of 
plant health. Fv/Fm has been shown to be 
strongly suppressed by virulent DC3000 (36), 
the extent of which is modulated by effective 
pattern-triggered immunity (37). Consistent 
with the reduced suppression of FRK1-LUC 
luciferase activity in FRKI-LUC plants, suppres- 
sion of Fv/Fm was reduced after infection 
with the DC3000AhopAML-1/1-2 mutant (Fig. 
6, D and E), and these immunity suppression 
phenotypes are directly linked to the production 
of v2-cADPR (3’cADPR; Fig. 6, F to I). These 
results indicate that v2-cADPR (3’cADPR) is re- 
sponsible for HopAM1’s suppression of plant im- 
munity (a speculative model is shown in fig. S13). 

Because HopAM1 acts in plant cells, we also 
tested its 2',3'-cAMP and 2',3’-cGMP synthe- 
tase activity, either as a recombinant protein 
with DNA as a substrate or in Arabidopsis chal- 
lenged with DC3000 or DC3000AhopAMI-1/1-2 
(fig. S14). In neither case could we detect any 
production of 2’,3'-cAMP, cGMP, cyclic cyti- 
dine triphosphate (cCTP), or cyclic uridine 
triphosphate (cUTP). 


Discussion 


Bacterial and plant TIR domains produce cy- 
clic signaling nucleotides with immune and 
virulence functions using NAD* or nucleic acids 
as substrates (14, 15, 21, 22, 26, 32, 33, 38-40). 
Here, we report the chemical structures of two 
TIR domain-produced cADPR isomers, v-cADPR 
and v2-cADPR, which reveal that TIR domains 
can catalyze O-glycosidic bond formation be- 
tween the ribose sugars in ADPR and that cy- 
clization occurs at the 2’ (v-cADPR; 2'’cADPR) 
and 3’ (v2-cADPR; 3’cADPR) positions of the 
adenosine ribose. These linkages differ from 
the canonical cADPR (produced by glycohydro- 
lases such as CD38, Aplysia californica ADP- 
ribosyl cyclase, and the SARM1 TIR domain), 
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Fig. 5. 3'’cADPR (v2-cADPR) changes 
ThsA tetramer organization. 

(A) Structure of SeThsA (left; chains 
colored in sky blue, yellow, orange, and 
green) and BcThsA (right; chains colored 
in slate, red, magenta, and blue) 
tetramers. SLOG and SIR2 dimers are 
highlighted by dashed black boxes. 

(B) Enlarged cutaways of SeThsA (left) 
and BcThsA (right) SIR2 dimers. Dashed 
circles represent active site regions with 
catalytically important residues (N113 
and H153 in SeThsA; N112 and H152 in 
BcThsA) displayed in stick representation 
(magenta). SIR2 dimers were superim- 
posed using SIR2“ of SeThsA and BcThsA. 
(C) Comparison of SIR2® in superimposed 
SeThsA and BcThsA SIR2 dimers. Move- 
ment of BcThsA SIR2® (salmon) with 
espect to the SeThsA SIR2® (yellow) is 
indicated by the black dashed arrows. 
(D) Surface representation of ThsA 
SIR2 domains. The dashed circles 
indicate the catalytic residues 
(magenta). (E) SeThsA SLOG dimer. 

(F) Structural superposition of 

SeThsA (orange) and BcThsA (slate) 
SLOG dimers. Movements of BcThsA 
SLOG domains with respect to the 
SeThsA SLOG domains are indicated 

by gray arrows. (G) Predicted model 

of SeThsA after 3'’cADPR binding. 
Dashed arrows indicate the direction 

of SeThsA SIR2 domain movements 
induced by v2-cADPR (3'cADPR) binding 
to the SLOG domains. (H) NADase 
activities of 50 uM SeThsA mutants + 

50 uM 3'cADPR. Initial NAD* concentra- 
tion was 500 uM. 


which is cyclized via the N1 position of the ade- 
nine ring (/6, 18, 47) and do not involve the 
alternative N positions (N6 and N7) of the ade- 
nine ring (J5, 21). NAD*-dependent O-glycosidic 
bond formation between ribose sugars is a new 
enzymatic activity of TIR domains, but it has 
been reported for the ADP-ribosyl transferase 
(ART) domain of poly(ADP-ribose) polymerases 
(PARPs), which catalyze 2’-1" and 2"-1" ribose- 
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BcThsA SIR28 


ribose bonds between ADPR molecules using 
NAD* as a substrate (42, 43). 

NADase activity of SARM1 and plant TIR 
domains requires self-association (13, 14, 25), 
and our biochemical studies with AbTir suggest 
that this is also the case for cADPR isomer- 
producing bacterial TIR domains. The CC 
domains of both AbTir and TcpB (27) self- 
associate in solution and may therefore have 
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SeThsA SIR24 


© Comet 
+ SADR 


+t cen acs 
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a similar role to SARM1 SAM domains, plant 
TNL NB-ARC domains, and bacterial SAVED 
(44) and STING (45) domains in facilitating 
TIR-domain clustering. 

Crystal structures of cADPR isomer-producing 
bacterial TIR domains AbTir™, BtTir™®, TepB™ 
(27, 29, 30), and BcThsB (23) do not display in- 
terfaces analogous to enzyme and scaffold TIR- 
domain assemblies (3). Our cryo-EM structure of 
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Fig. 6. 3'cADPR (v2-cADPR) pro- A 

duction and immunity suppression 97 6 colo 

by HopAML1. (A) ROS production of g | @ HopAMiilestradiol _ 

Arabidopsis transgenic plants expressing 74 : Dee iwtiaese del: 

the P. syringae DC3000 effector HopAM1 > @ HopAM1914/no estradiol 

or HopAMIES"4, induced with 10 uM = °] 

estradiol. ROS production was induced 5 5 4 . 5 

with 1 uM flg22. Data are presented as z 4-4 i : 1 
means + SE (n = 12; Student’s t test, 34 iF “ates ite $3 
p < 0.01). RLU, luminescence unit. | q 

(B) Quantification of NAD* and ; if ppetteeseoe es tensity 
3'cADPR from transgenic leaf samples i i] H a : 

in (A). Data are presented as means + 0 T y 1 
SE (n = 3). Transgenic plants : Sas iiauey e 
expressing HopAM1 (estradiol- B 


induced) produce significantly lower 
levels of ROS and significantly higher je 


= 
b 


amounts of 3'cADPR, whereas the 12 . 
differences in NAD” levels are not ee 


significant (Student's t test; p < 0.01). 
mAU, milli-absorbance unit. (C) Sup- 
pression of pattern-triggered immunity 
measured by FRK1-LUC activity 

after challenge with DC3000 or the 
DC3000AhopAMI-1/1-2 mutant. hpi, 
hours postinoculation. (D) Representa- 
tive chlorophyll fluorescence images of 
Fv/Fm, a nondestructive measurement 
elated to photosynthetic activity that 
elates to plant health, after challenge 
with DC3000, the AhopAMI-1/1-2 
mutant, or the effector-deficient mutant 
DC3000 D28E (reporting pattern 
triggered immunity). (E) The respective 
quantitative Fv/Fm measurements 
over the infection time course in (D). 
Error bars represent SE between three 
biological replicates of the same 
challenge, with the AhopAMI1-1/1-2 
mutant being significantly different 
from DC3000 (Student's t test, 
p < 0.05 from 18 hpi). The graph is F 
representative of four independent 
experiments. (F to 1) LC-MS/MS 
analysis at 18 hpi of v2-cADPR 
(3'cADPR) production in A. thaliana 
Col-0 challenged with virulent DC3000 
(F) or the AhopAMI-1/1-2 mutant (G) 
and the respective spectra when co- 
injected with v2-cADPR (3'cADPR) 
standard from AaTir"® [(H) and (I)]. 

The retention time (x axis) is reported 0 
in minutes; the y axis reports the 
relative count per second (with 100% 

corresponding to the most intense peak). All leaves wel 


oo 


Intensity (mAU * S) 


oN FF OC 


the filamentous assembly of AbTir™®:3AD re- 
veals a scaffold assembly arrangement analo- 
gous to MAL and MyD88 (5, 7). Our mutagenesis 
data confirm that the observed arrangement is 
important for its catalytic function and that 
an intact BB loop is required. The active site is 
similar to the one in SARMI1 (25), consistent 
with it being formed though the analogous BE 


Manik et al., Science 377, eadc8969 (2022) 


Col-0 

HopAM1/estradiol 
HopAM1£'1914/estradiol 
HopAM1/no estradiol 
HopAM1£1914/no estradiol 


NAD+ 3'cADPR 


@e=Col-0 DC3000 
omCol-0 D28E 
e@0=Col-0 AhopAM1-1/1-2 


1 4 7 10 13 16 
Hours post inoculation 


19 


fa 


G 
, DC3000 400 AhopAM1-1/1-2 4 


5.00 


10.00 


5.00 10.00 


DC3000 


AhopAM1-1/1-2 


DC3000 


Dc3000_—s | 
+3’cADPR 400 
05 


AhopAM1-1/1-2 
408 +3°cADPR 


3.99 


13 
417 
z 25 
5.46 
] 
5.00 10.00 5.00 10.00 
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interface-mediated association. Conformational 
changes in AbTir™™® monomers are required to 
form the filamentous assembly. Future studies 
will define how additional domains in bacte- 
rial TIR proteins facilitate the active config- 
uration and what size active complexes form 
in bacterial cells. The symmetric interface— 
found in crystal structures of all bacterial TIR 
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domain-containing proteins with known struc- 
ture, except for BcThsB (23, 27-30), and shown 
to facilitate TcpB self-association and modula- 
tion of Toll-like receptor signaling (27, 46)— 
may play a regulatory role in the transition to 
the active state. 

A highly conserved tryptophan residue in 
the oC helical region, part of the previously 
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defined WXXXE motif (47) (where X is any 
residue), plays a role in the cyclization of ADPR 
by bacterial TIR domains. The equivalent tryp- 
tophan also plays a role in the catalytic activity 
by SARM1 and plant TIR domains. In SARM1, 
this tryptophan (W638) mediates aromatic 
stacking interactions with NAD* mimetics and 
the W638A mutation reduces NADase activ- 
ity (13, 25). In the flax L6 TNL protein, alanine 
mutation of the equivalent tryptophan (W131) 
abrogates cell-death signaling, and, in the 
cryo-EM structure of the related flax L7 TIR 
domain in complex with DNA, it is involved in 
interaction with the product 2',3'-cAMP (26, 48). 
In the A. californica ADP ribose cyclase, related 
to CD38 and similar to AbTir and AaTir in that 
it produces a cyclic ADPR (canonical cADPR) 
as the major product of NAD* hydrolysis, a 
phenylalanine residue (F174) directs the fold- 
ing of the substrate during the cyclization re- 
action by interacting with the adenine base of 
ADPR after nicotinamide cleavage (49, 50). 
Cyclization of 1’-2' and 1"-3' ribose O-glycosidic 
bonds in ADPR by TIR domains will require the 
two ribose sugars to come into proximity after 
nicotinamide cleavage and a possible role for 
the conserved tryptophan residue is to facili- 
tate such substrate folding through interaction 
with the adenine base. 

Although multiple plant TNL proteins and 
bacterial TIR proteins produce cADPR isomers, 
their mechanism of action and targets are only 
starting to be resolved. We speculate that the 
ThsB-produced cADPR isomer of the Thoeris 
defense system (22) corresponds to v2-cADPR 
(3'cADPR) because this isomer is a strong ac- 
tivator of ThsA NADase activity and has nano- 
molar affinity for ThsA proteins from different 
bacteria (Fig. 4). 2’cADPR (v-cADPR) can also 
trigger the NADase activity of ThsA [Fig. 4B 
and (22)], but a higher concentration is needed 
to activate ThsA because it shows weaker bind- 
ing than v2-cADPR (3’cADPR). Consistent with 
our biochemical data, the BeThsA““°@-v2-cADPR 
(3'cADPR) complex structure reveals that the 
binding pocket is selective for v2-cADPR (3’ 
cADPR), which induces a reorganization of the 
ThsA tetramer to allosterically promote bind- 
ing to its substrate NAD*. This mode of action 
is reminiscent of the NMN-induced activation 
of SARM1, which is only able to bind and cleave 
the substrate NAD* after a change to its oc- 
tamer organization triggered by NMN bind- 
ing to its ARM domain (25). 2‘cADPR (v-cADPR) 
produced by the protein BdTIR from the plant 
Brachipodium distachyon has been found to 
bind to the protein Thoeris anti-defense 1 (Tad1) 
that inhibits Thoeris immunity (57), but there 
is no data showing that this is the isomer 
produced by ThsB proteins. Future studies 
will determine the identity of ThsB-produced 
cADPR isomers, the structural basis for 
how they activate ThsA, and whether SLOG 
domains in cytokinin-activating proteins in 
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plants (52, 53) are also receptors for cADPR 
isomers. 

The nucleotides pRib-AMP and pRib-ADP 
[2’-5"-phosphoribosyl)-5'-adenosine monophos- 
phate and 2’-(5’-phosphoribosyl)-5'-adenosine 
diphosphate] were shown to trigger immune 
signaling in plants by allosterically promoting 
the EDSI1 (enhanced disease susceptibility 1)- 
PAD4 (phytoalexin deficient 4) complex to 
bind to the plant NLR protein ADR1-L1; the 
production requires TIR proteins (32). Plant 
TIR domains can also generate ADP-ribosylated 
ATP (ADPr-ATP) and di-ADPR, which in turn 
promote the association of EDS1 and SAGI101 
(senescence-associated gene 101) with the helper 
NLR NRGIA (N requirement gene 1A) (33). 
Multiple TIR domains produce cADPR isomers 
in plants, but the EDS1-binding molecules have 
not yet been detected in planta and their bio- 
synthetic pathways are not defined. The au- 
thors of (32, 33) proposed a pathway for the 
production of pRib-AMP and pRib-ADP as well 
as ADPr-ATP and di-ADPR that does not in- 
volve cADPR isomers. Our cADPR isomer 
structures show that pRib-AMP can be de- 
rived directly from v-cADPR (2’cADPR) by 
cleavage of its pyrophosphate bond, suggest- 
ing that NAD* could be the substrate in this 
case. The cleavage of the pyrophosphate bond 
could indicate the involvement of plant NUDIX 
hydrolases like NUDX6 and 7, which regulate 
plant immunity by degrading 2’,3’-cAMP and 
2',3'-cGMP, the other nucleotides putatively 
produced by TIR domains (26). 

'.O-B-p-ribofuranosyladenosine, which has an 
identical 1’-3' O-glycosidic linkage to v2-cADPR 
(3’'cADPR) but lacks the phosphate groups, 
has been shown to accumulate in leaves in- 
fected with the HopAM1-producing bacterium 
P. syringae DC3000 (54), suggesting that cADPR 
isomers can be further modified in plants. Pyro- 
phosphate bond cleavage of HopAM1-produced 
v2-cADPR (3’cADPR), followed by removal of 
the two ribose-5-phosphate groups, is a possi- 
ble synthetic path for this nucleotide product, 
which suggests that cADPR isomers perhaps 
not only serve as signaling molecules but are 
also important intermediates in the synthesis 
of additional previously uncharacterized nu- 
cleosides associated with plant immunity. In 
conclusion, our study unravels the cyclization 
site of cADPR isomers and informs on their 
production by TIR domains and their signaling 
in immunity pathways in bacteria and plants. 


Methods summary 


A comprehensive description of materials and 
methods is presented in the supplementary 
materials. These descriptions include the de- 
tails of cloning of expression plasmids and side- 
directed mutagenesis, protein expression and 
purification, different assays of NAD* cleavage 
and characterization of its products (fluorescence, 
NMR, HPLC, and mass spectrometry-based ap- 
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proaches), and bioinformatic analysis. For the 
determination of chemical structures of cADPR 
isomers, the compounds were produced by 
AbTir, AaTir, and HopAM1 and the structures 
were determined using NMR (assignments 
are provided in tables S2 to S4) and liquid 
chromatography-tandem mass spectrometry 
(LC-MS/MS). Protein:ligand and protein:protein 
interactions were characterized by STD NMR, 
ITC, and SEC-MALS. Three-dimensional struc- 
tures were determined using x-ray crystallogra- 
phy (crystallographic parameters are provided 
in table S5) and cryo-EM (structure determi- 
nation details are provided in fig. S5 and table 
S6). Also described are phytobacterial chal- 
lenge and ROS assays in HopAM1-expressing 
Arabidopsis plants. 
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INTRODUCTION: The dorsolateral prefrontal 
cortex (dIPFC) lies at the center of high-order 
cognition and complex social behaviors, 
which are highly derived traits in anthropoid 
primates—particularly in humans. 


RATIONALE: The granular dlPFC represents 
an evolutionary specialization found only 
in anthropoid primates, and alterations in the 
molecular and cellular mechanisms underlying 
its intricate circuitry have been implicated in 
myriad neuropsychiatric diseases. However, 
little is known about the full repertoire of cell 
types in the primate dIPFC and how conserved 
these cell types are between human and other 
primate species. 


RESULTS: We generated single-nucleus tran- 
scriptome data profiling more than 600,000 
nuclei from the dIPFC of adult humans, 


Transcriptomic taxonomy 
of the dIPFC in four 
anthropoid primates. (Top 
left) Homologous regions 
of the dIPFC dissected for 
snRNA-seq and sn-multiome Ae 
analyses. (Top right) 114 wy 
hierarchically organized 
(dendrogram) transcriptomi- 
cally defined cell subtypes 
distributed across the four 
species (bar plots; same @ Macaque 

color code as in the top oN 
left panel), with species- Ge 
specific variations high- ~ Marmoset 
lighted. (Bottom) Notable 
molecular changes across 
species featured by species- 
specific FOXP2 expression ‘SST 
and the human-specific 
posttranscriptional switching 
between SST and TH. 
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chimpanzees, rhesus macaques, and common 
marmosets, thus spanning major primate phy- 
logenetic groups. To study regulatory mecha- 
nisms underlying human-specific divergence, 
we generated single-nucleus multiome data 
(snATAC-seq and snRNA-seq) profiling the 
human dIPFC. Through cell clustering, marker 
gene expression, and integration with pub- 
lished multimodal and multispecies data we 
defined three levels of hierarchically organized 
taxonomy of transcriptomically defined neu- 
ronal, glial, and non-neural cell types in the 
four species, including four major cell classes, 
29 subclasses and 114 subtypes. 

Most cell subtypes were conserved across 
the four species but we unraveled prominent 
species differences both at the molecular and 
cellular levels. We identified five cell subtypes 
detected in only a subset of species, including a 
layer (L) 2-3 intratelencephalic subtype absent 
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in marmosets, an inhibitory neuron subtype 
exclusive to marmosets, and a microglial sub- 
type detected only in humans. Cross-species 
comparisons of cell type proportions showed that 
12-3 intratelencephalic neurons underwent sub- 
stantial expansion in humans compared with 
other species as well as in Catarrhini as com- 
pared with marmosets. Gene expression entropy 
analysis revealed more transcriptomic heteroge- 
neity among L2-3 intratelencephalic neurons 
in Catarrhini compared to marmosets. These 
results confirm and extend theories of primate 
cortical expansion. 

Within homologous cell subtypes across 
species, we identified prominent molecular 
changes. These are characterized by loss of 
expression of tyrosine hydroxylase (TH), the 
rate-limiting enzyme in catecholamine (includ- 
ing dopamine) biosynthesis, in the inhibitory 
neurons of chimpanzees that are homologous 
to TH-expressing inhibitory neurons in the 
other species studied. Among TH-expressing 
homologous cell subtypes in humans, maca- 
ques, and marmosets, we identified a human- 
specific posttranscriptional switch between the 
neuropeptide SST and TH, and the human- 
specific expression of genes involved in dopa- 
minergic function. 

Through transcriptomic comparisons across 
the four primate species and immunohisto- 
chemistry across 51 mammal species, we found 
that the neuropsychiatric risk gene FOXP2 ex- 
hibited human-specific expression in microglia 
and primate-specific expression in L4 excit- 
atory neurons. By integrating chromatin 
accessibility and gene coexpression, we iden- 
tified cis-regulatory elements regulating FOXP2 
expression and constructed FOXP2 regulatory 
networks including downstream targets mir- 
roring the cell type- and species-specific FOXP2 
expression patterns. 


CONCLUSION: We produced a transcriptomic 
catalog of the primate dIPFC cell types, com- 
plemented with epigenomic characterization 
in the human dIPFC. Our analyses delineated 
cell type homology and transcriptomic con- 
servation across species and identified species 
divergence at the molecular and cellular levels, 
as well as potential epigenomic mechanisms 
underlying these differences. Shared and species- 
divergent features were implicated in biological 
pathways and neuropsychiatric diseases. Our 
data may serve as a resource for future studies 
on prefrontal cortex function and disease. 
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CLE 


The granular dorsolateral prefrontal cortex (dIPFC) is an evolutionary specialization of primates that is 
centrally involved in cognition. We assessed more than 600,000 single-nucleus transcriptomes from adult 
human, chimpanzee, macaque, and marmoset dIPFC. Although most cell subtypes defined transcriptomically 
are conserved, we detected several that exist only in a subset of species as well as substantial species- 
specific molecular differences across homologous neuronal, glial, and non-neural subtypes. The latter are 
exemplified by human-specific switching between expression of the neuropeptide somatostatin and tyrosine 
hydroxylase, the rate-limiting enzyme in dopamine production in certain interneurons. The above molecular 
differences are also illustrated by expression of the neuropsychiatric risk gene FOXP2, which is human- 
specific in microglia and primate-specific in layer 4 granular neurons. We generated a comprehensive survey 


of the dIPFC cellular repertoire and its shared and divergent features in anthropoid primates. 


he expansion and evolutionary special- 

izations of the cerebral cortex—especially 

the prefrontal cortex (PFC)—are thought 

to underlie the rich and complex nature 

of cognition in humans and other pri- 
mates (J, 2). The PFC is larger and anatom- 
ically more complex in anthropoid primates 
(i.e., monkeys and apes) compared with that of 
other analyzed mammals, covering the entire 
dorsolateral two-thirds of the frontal lobe and 
having a well-defined granular layer (L) 4 
(1-6). The granular dorsolateral PFC (d1PFC) 
exhibits broad connectivity with other brain 
regions (J, 2). Dysfunction of the dIPFC has 
been implicated in the etiology of many neu- 
ropsychiatric disorders (7, 8). Accordingly, non- 
human primates have been used as model 
species as their complex cognition, expanded 
PFC, and genetic makeup best approximate 


that of humans (9). Single-cell genomics has 
emerged as a central tool in understanding 
cell taxonomy and evolution of the brain, but 
previous efforts to characterize human PFC 
were not extensive in the scope of cell type 
classification and nonhuman primate species 
selected for comparison (0-15). 

To better understand the cellular repertoire 
of the dIPFC and its conserved and divergent 
features, we performed single-nucleus RNA 
sequencing (snRNA-seq) of the dIPFC in 
adult humans (Homo sapiens), chimpanzees 
(Pan troglodytes), rhesus macaques (Macaca 
mulatta), and common marmosets (Calli- 
thrix jacchus), representing major anthro- 
poid phylogenetic groups: Hominini (here re- 
presented by humans and chimpanzees), 
Catarrhini (humans, chimpanzees, and ma- 
caques) and Platyrrhini (marmosets). The in- 


clusion of chimpanzees—one of the closest 
living relatives of humans—as well as two 
outgroup species enabled robust identifica- 
tion of human-specific features. The compu- 
tational analysis of snRNA-seq data revealed 
diverse neuronal, glial, and non-neural tran- 
scriptomically defined cell types (hereafter 
referred to as subtypes) and uncovered mul- 
tiple conserved and divergent features, includ- 
ing those found exclusively in humans. To 
delineate regulatory mechanisms underlying 
human-specific changes, we conducted single- 
nucleus multiome (sn-multiome) characteriz- 
ing chromatin accessibility [ATAC sequencing 
(ATAC-seq)] and gene expression (RNA-seq) 
simultaneously on adult human dIPFC. This 
resource is interactively accessible at http:// 
resources.sestanlab.org/PFC/. 


Transcriptomic classification of dIPFC cells in 
anthropoid primates 


We performed snRNA-seq on histologically 
validated adult postmortem dIPFC samples 
from neurotypical male and female humans, 
chimpanzees, rhesus macaques, and common 
marmosets (Fig. 1A, fig. SIA, and table S1). 
Each species group included four donors with 
four technical replicates per donor, which sub- 
stantially increased analytic power (fig. S1, B 
to D). Analysis with the consensus genome 
annotation (fig. S2; materials and methods), 
followed by stringent quality control, yielded 
transcriptomic data from 610,719 high-quality 
nuclei comparably distributed across the four 
species (fig. S1E). 

Based on unsupervised clustering, expres- 
sion profiling of marker genes, and tran- 
scriptomic integration with published human 
and mouse single-cell and spatial transcrip- 
tomic datasets (J6—24) (materials and methods), 
we defined three levels of hierarchically related 
taxonomy of neuronal, glial, and non-neural 
subtypes (Fig. 1B and fig. S3A). Level 1included 
four major classes with 209,824 glutamatergic 
excitatory neurons, 101,845 GABAergic inhib- 
itory neurons, 247,660 glial cells, and 51,390 
non-neural cells. Level 2 included 29 sub- 
classes identified as 10 excitatory neuron sub- 
classes characterized by their layer and axon 
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Fig. 1. Transcriptomically defined cell taxonomy of anthropoid primate 
dIPFC. (A) Species (red, human; blue, chimpanzee; green, macaque; yellow, 
marmoset) and homologous dIPFC areas (blue) analyzed using snRNA-seq. 
(B) Overview of the snRNA-seq dataset, depicting information from 

inner- to outermost as follows: UMAP of all nuclei; dendrogram of 
hierarchically related subtypes; subtype proportions; marker gene 
expression; species representation in each subtype (color conforming to 
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panel A); 4 classes; 29 subclasses; and 114 refined subtypes. The subtypes 
detected only in some species are highlighted. See fig. S3A for a more detailed 
description of the cell taxonomy. IT, intratelencephalic; ET, extratelencephalic; NP, 
near-projecting: CT, corticothalamic; ChC, chandelier cells; Astro, astrocyte; Micro, 
microglia; Oligo, oligodendrocytes; OPC: oligodendrocyte precursor cells; Endo, 
endothelial cells; RB, red blood lineage cells; PC, pericyte; SMC, smooth muscle cells; 
VLMC, vascular leptomeningeal cells. 


projection properties (intratelencephalic, extra- 
telencephalic, near projecting, corticothalamic, 
and L6B); nine inhibitory neuron subclasses 
characterized by established marker genes; 
four glial cell subclasses (astrocytes, oligoden- 
drocyte precursor cells, oligodendrocytes, and 
microglia); and six non-neural cell subclasses 
(immune cells, endothelial cells, red blood 
lineage cells, pericytes, smooth muscle cells, 
and vascular leptomeningeal cells). Level 3 
included 114 further refined subtypes within 
each of the subclasses. These subtypes have a 
balanced contribution from donors in each spe- 
cies and their transcriptomic stability is gen- 
erally not affected after randomly removing 
one donor in a given species (fig. S3, A and B). 

All 29 subclasses and most of the 114 sub- 
types were detected in all four primate species 
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(Fig. 1B). Cell homologies were unaffected when 
integrating data using different methods (fig. 
S3C) or clustering nuclei within each species 
separately (figs. S11A, SI3A, and S16A). How- 
ever, five subtypes were detected only in some 
species (hereafter referred to as species-specific 
subtypes), with minimal bias being introduced 
by donors of that given species (fig. S4, A to D). 
The five subtypes included the excitatory neuron 
subtype L2-3 CUX2 ARHGAP18 absent in 
marmosets, the inhibitory neuron subtype 
InN LAMP5 SYTIO detected only in marmo- 
sets, the astroglial subtype Astro AQP4 OSMR 
detected only in humans and chimpanzees, 
the microglial subtype Micro P2RY12 CCL3 
detected only in humans, and the microglial 
subtype Micro P2RY12 GLDN detected only 
in humans and chimpanzees. 
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Cellular and transcriptomic changes across 
anthropoid primate dIPFC subtypes 

The multifaceted functions of the dIPFC re- 
quire the orchestration of diverse cell types 
that have undergone substantial specializa- 
tion, which could include the emergence of 
new cell subtypes, changes in the abundance 
of a shared or conserved cell type, variation 
in intercell heterogeneity, or reorganization 
of molecular features in conserved cell types 
(Fig. 2A). 

We first assessed whether the five species- 
specific subtypes (Fig. 2B) were transcrip- 
tomically unique for those species. For this, we 
calculated the markers for each species-specific 
subtype and visualized their expression en- 
richment across species on the uniform mani- 
fold approximation and projection (UMAP) 
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Fig. 2. Shared and divergent features of dIPFC subtypes across anthropoid 
primates. (A) Diagram illustrating possible models of species cellular and 
transcriptomic differences. (B) Subtype abundance comparisons across species. 
Each pie represents a subtype with size indicating the average subtype proportions 
across the four species (materials and methods). Links represent the expression 
correlation among subtypes. Subtypes showing species-enrichment in abundance 
are opaque in color whereas semitransparent or muted colors indicate no 
enrichment in abundance. Species-specific subtypes are highlighted by dashed 
circles. (C) Transcriptomic heterogeneity among cells of the same subclasses. 
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Significance was tested with a pairwise Wilcoxon Rank Sum test with Bonferroni 
correction (** adjusted P < 0.01). (D) Raw transcriptomic divergence across 
subtypes (columns) in all species pairs (rows). IT, intratelencephalic; ET, 
extratelencephalic; NP, near-projecting: CT, corticothalamic; ChC, chandelier cells; 
Astro, astrocyte; Micro, microglia; Oligo, oligodendrocytes; OPC, oligodendrocyte 
precursor cells; Endo, endothelial cells; RB, red blood lineage cells; PC, pericyte; 
SMC, smooth muscle cells; VLMC, vascular leptomeningeal cells; huMicro, human- 
specific microglia (Micro P2RY12 CCL3); hoMicro, Hominini-specific microglia (Micro 
P2RY12 GLDN). rAstro: reactive astrocyte (Astro AQP4 OSMR). 


layout (fig. S4, A to D). The homologous cells 
of each species-specific subtype, if present 
in one species, were found to be marked by 
high enrichment scores and clustered together 
on the UMAP layout (fig. S4, A to D). By con- 
trast, no cells or only a scarce number of cells 
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sparsely distributed on the UMAP were labeled 
by high enrichment scores in the species in 
which the species-specific subtypes were ab- 
sent. By leveraging available cross-species 
snRNA-seq data (22, 25), we confirmed the 
species-specific presence of subtypes L2-3 CUX2 


022 


ARHGAPI8 and InN LAMP5 SYTIO and Micro 
P2RY12 CCL3 and demonstrated that the detec- 
tion of species-specific subtypes is not limited to 
the donors analyzed in this study (figs. S4E and 
SI7F). To further corroborate these results, we 
conducted single-molecule fluorescence in situ 
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Fig. 3. Taxonomy and divergent 
features of dIPFC excitatory 
neuron subtypes. (A) UMAP 
visualization of all glutamatergic 
excitatory neuron subtypes with 
subclasses labeled. (B) (Top) 
UMAP showing all L6 IT-2 
subtypes. (Bottom) Expression of 
subtype markers shared in 
Catarrhini (22 species) or specific 
to marmosets. (C) (Top) Enrich- 
ment of Gene Ontology (GO) 
terms in the species-enriched 
genes from each species, with 
colors indicating the species. Only 
the GO terms enriched in at least 
three species are displayed. 
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hybridization for representative marker genes 
and validated the species-specificity of the 
Catarrihini-specific upper layer intratelen- 
cephalic subtype L2-3 CUX2 ARHGAPI8, the 
marmoset-specific inhibitory neuron subtype 
InN LAMPS SYT1O, and the Hominini-specific 
enrichment in astroglial subtype Astro AQP4 
OSMR (figs. S5 to S7). Similar to the observa- 
tion in the snRNA-seq data (fig. S4), we ob- 
served only a limited number of cells labeled 
by in situ hybridization in the species lacking 
the species-specific subtypes. 
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Scaled expression | 


Although most subtypes are shared across 
the four analyzed primates, changes in their 
relative abundance can contribute to distinct 
cellular networks. An L2-3 intratelencephalic 
subtype (L2-3 CUX2 ACVRIC THSD7A)—the 
most abundant subtype among all L2-3 intra- 
telencephalic subtypes—was found to be enriched 
in humans compared with other species and 
enriched in Catarrhini compared with mar- 
mosets (Fig. 2B and fig. S8). By contrast, we 
observed notable reduction of the L5 extra- 
telencephalic subtype and an L6 corticothalamic 
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subtype in Catarrhini compared with marmo- 
sets (Fig. 2B). 

Differences in cellular diversity across spe- 
cies can also arise when cell subpopulations 
within a subclass become transcriptomically 
more diversified or more homogenous within 
a given species. To delineate transcriptomic 
heterogeneity among cells within a given sub- 
class, we adopted the concept of entropy, a 
measurement of uncertainty, in analyzing gene 
expression variability (fig. S9A; materials and 
methods). Larger entropy values associate with 
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higher gene expression variability and vice versa. 
By subtracting the entropy in permutated data 
from that in the actual data we were able to 
reveal the overall transcriptomic heterogeneity 
among cells within each homologous subclass 
across species (Fig. 2C). We found a considerable 
heterogeneity increase in Catarrhini compared 
with marmosets in multiple neuronal and glial 
subclasses such as L2-3 intratelencephalic, PVALB 
chandelier cells, and astrocyte subclasses (Fig. 
2C). Such heterogeneity arising in Catarrhini 
L2-3 intratelencephalic excitatory neurons con- 
firms previous reports (/6, 26) and extends our 
observations of expanded diversity and abun- 
dance in upper layer neurons in Catarrhini 
(Fig. 2B). Patterns of increased heterogeneity 
in Catarrhini were further supported by the 
augmented cell separation in principal com- 
ponent dimensions and the expansion of mar- 
ker expression profiles distinguishing subtypes 
in Catarrhini (fig. $9, B and C). Although 
many of these subtype markers shared across 
Catarrhini were also expressed broadly in 
marmosets, their expression was not distin- 
guishable among the same subtypes (fig. S9, D 
and E). This result suggests a possible mech- 
anism underlying cell type heterogeneity where- 
by genes originally shared among different 
subtypes become variable, resulting in greater 
intersubtype transcriptomic differences. 

Species differences can also arise from diver- 
gent gene expression in homologous types. We 
defined transcriptomic divergence based upon 
expression correlation of highly variable genes 
between homologous subtypes for each species- 
wise pair (materials and methods). Observed 
patterns of transcriptomic divergence mirrored 
evolutionary distances between the four species 
with the human-chimpanzee and chimpanzee- 
marmoset pairs showing the smallest and 
largest transcriptomic divergence, respectively 
(Fig. 2D). This analysis also showed that glia 
cell and non-neural cell subtypes were the 
most divergent in all species pairs (Fig. 2D), 
consistent with studies comparing primate 
species at the level of bulk tissue and cell- 
sorted populations (27, 28). We confirmed that 
these observations were not influenced by 
unequal cell numbers across subtypes or using 
a different highly variable gene set (fig. S10, A 
and B) and were commensurate with species 
differences measured by the proportion of dif- 
ferentially expressed genes (fig. S10C). Together, 
this revealed that multiple sources of variation 
contributed to neuronal, glial, and non-neural 
cell type differences across anthropoids. 


Conserved and divergent features of 
excitatory neuron subtypes 


We next investigated each major cell class to 
better delineate conserved and divergent features 
among subtypes. Excitatory neurons in each 
species were comprised by a multitude of 
subtypes predicted to have distinct properties 
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(Fig. 3A and fig. S11A). Imputation of layer dis- 
tribution and long-range projection types by 
integrating our data with published snRNA-seq 
data from human middle temporal gyrus 
(MTG) (6), spatial transcriptomic data from 
human PFC (78), and data from mouse pri- 
mary visual cortex including projection identity 
(17) showed that excitatory neuron subtypes can 
be classified into 10 subclasses. These subclasses 
are characterized according to their predicted 
L2 to L6 layer localization and projection types 
(Fig. 1B and fig. S11, B to D). The intratelence- 
phalic subclass that accounted for more than 
80% of the excitatory neurons was predicted to 
span layers L2 to L6 and was marked by layer- 
specific markers (L2-3, PCDH8; L3-5, RORB; 
L6, OPRK7; Fig. 1B). The extratelencephalic, 
corticothalamic, and other subclasses were pre- 
dicted to be localized in deep layers and were 
transcriptomically distinct (L5 extratelencephalic, 
FEZF2, BCLUB, and POU3F1; L6 corticothalamic, 
SYT6; L5-6 near-projecting, HTR2C; L6B, CTGF; 
Fig. 1B). 

Assessment of subclass heterogeneity across 
species revealed one L6 intratelencephalic sub- 
class (L6 IT-2) having the greatest increase of 
intercell transcriptomic differences in Catarrhini 
compared with marmosets (Fig. 2C). In line with 
this observation marmoset subtypes of this 
subclass were less separated on the UMAP and 
expressed fewer genes distinguishing them 
when compared with humans, chimpanzees, 
and macaques (Fig. 3B). Genes delineating the 
subtypes of this subclass in Catarrhini also 
mapped onto important biological pathways 
including extracellular matrix and cell-cell in- 
teraction (ADAMTSI7, CDH18), calcium ion 
binding proteins (KCNH7, KCND2), and pro- 
tein glycosylation (GALNTI8, GALNTL6), sug- 
gesting more diversified cellular functions. 

We next identified transcriptomic changes 
restricted to each excitatory neuron subtype 
(fig. S12A). Within each species, we identified 
more species-enriched genes than species- 
depleted genes. As expected, marmosets had 
both more up- and downregulated genes com- 
pared to humans, chimpanzees, and macaques. 
Gene ontology enrichment of the exclusively 
enriched genes for each species converged 
upon processes related to extracellular matrix 
and calcium ion binding functions involved in 
synaptic transmission (Fig. 3C). For instance, 
genes within the annexin family, which play a 
critical role in calcium ion binding, were used 
differentially across primate brains: humans 
selectively expressed ANXAI in upper-layer 
intratelencephalic neurons whereas maca- 
ques specifically expressed ANXA3 in deep- 
layer intratelencephalic neurons. This analysis 
also highlighted human L3-5 intratelencephalic 
subclass enrichment of PKD2L1, which encodes 
a calcium-regulated cation channel. These re- 
sults suggest that the components of the same 
biological pathways are differentially recruited 
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across species, resulting in divergent molecu- 
lar and cellular functions. 

We further filtered genes with species- and 
subclass-restricted expression patterns using 
more stringent criteria (materials and methods), 
resulting in 20 to 51 genes in each species with 
potentially higher functional relevance (Fig. 3D 
and fig. S12B). The 29 genes detected in the 
human lineage included genes encoding tran- 
scription factors (TWIST2 and ELK3), genes as- 
sociated with autism spectrum disorder (ASD) 
in the SFARI gene list (https://gene.sfari.org/; 
CCDC88C, MET, ANXAT) as well as genes en- 
coding critical retinoic acid signaling compo- 
nents (ALDHIAI; Fig. 3D and fig. $12, C and D), 
which have recently been shown to regulate 
gene expression, spinogenesis, and connectiv- 
ity in PFC (29). We also found other retinoic 
acid signaling-associated genes exhibiting di- 
vergent expression across species: CYP26B1, 
enriched in Hominini L5-6 near-projecting and 
L6 corticothalamic subclasses, and CBLN2, 
globally upregulated in multiple Hominini 
excitatory neuron subclasses. These findings 
validate recently reported Hominini-specific 
changes in the retinoic acid-responsive en- 
hancer predicted to upregulate CBLN2 expres- 
sion in PFC excitatory neurons (30). 


Conserved and divergent features of 
interneuron subtypes 


Based on marker gene expression and tran- 
scriptomic integration with other human and 
mouse single-cell RNA-seq data (72, 16, 17, 19, 20) 
(Fig. 4A, fig. S13, B to D, and fig. S14, A to C), 
we classified inhibitory neurons into nine 
subclasses predicted as follows: long-range 
projecting neurons (SST NPY), Martinotti and 
non-Martinotti cells (SST), basket cells (PVALB), 
chandelier cells (PVALB ChC), Ivy cells (LAMPS 
LHX6), LAMPS neurogliaform cells (LAMP5 
RELN), VIP inhibitory neurons (VIP), homologs 
of human tyrosine hydroxylase (TH)-expressing 
inhibitory neurons (SST HGF), and homologs 
of mouse Sneg inhibitory neurons (ADARB2 
KCNGI). We detected two PVALB chandelier 
subtypes marked by distinct expression profiles, 
showing specific enrichment of NMDA gluta- 
mate receptors regulating excitatory synaptic 
communication as well as nonclustered pro- 
tocadherins mediating cell adhesion (fig. S14, 
D to E). This pattern was observed in all four 
primates analyzed (fig. S14, D to E), suggesting 
conserved functional diversification between 
the two PVALB chandelier subtypes. 

These inhibitory neuron subtypes were pre- 
dicted to have different developmental origins 
based on lineage-related marker expression: 
medial ganglionic eminence (MGE), LHX6; 
caudal ganglionic eminence (CGE), ADARB2, 
NR2F2, and PROX1, figs. S13D and S14, A and 
B). We found that the LAMP5 LHX6 subclass 
consisted of two subtypes marked by LHX6 
expression, among which one subtype also 
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Fig. 4. Taxonomy and divergent features of dIPFC inhibitory neuron sub- 
types. (A) UMAP layout of all GABAergic inhibitory neuron subtypes with 
subclasses labeled. ChC, chandelier cells. (B) (Left) Sankey plots showing 
correspondence between predicted laminar organization and subtypes and/or 
subclasses. (Right) SST expression across putative cortical layers. Signifi- 
cance was tested with a one-sided Wilcoxon Rank Sum test (***, P < 0.001: 
ns, not significant). (C) colocalization of SST and RELN (arrowhead) 


expressed PROX] (LHX6+/PROX1+; fig. S13D). 
The LAMP5 LHX6 subclass was previously re- 
ported to be enriched in primates (12, 76) and 
was considered to be derived from MGE (31). 
We found that the markers of the LHX6+/ 
PROXI1+ subtype showed expression enrich- 
ment in CGE-derived inhibitory neurons whereas 
the markers of the LHX6+/PROXI- subtype 
showed enrichment in MGE-derived inhibi- 
tory neurons (fig. S14F). This pattern was con- 
served across all four studied primates (fig. 
S14F), suggesting a potential overlap of PROX] 
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human TH+ neurons. 


and LHX6 molecular mechanisms commonly 
present in all primates analyzed. 

Imputation of layer distribution by tran- 
scriptomic integration with human MTG in- 
hibitory neurons (fig. S14G) (76) showed that 
SST and PVALB inhibitory neuron subtypes 
were preferentially located from L2 to L6 
whereas L1 inhibitory neurons included only 
ADARB2 KCNGI, VIP, and LAMP5 subtypes 
(Fig. 4B). Some putative L1 inhibitory neuron 
subtypes in humans also expressed SST (Fig. 
4B and fig. S14G) but they lacked expression of 
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TH protein+/SST protein- 
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in human dIPFC L1 revealed by double RNA in situ hybridization. Scale bar, 
20 um. (D) (Left) log-normalized expression of TH. (Middle) Double 
immunostaining for SST and TH. (Right) Proportion of SST+ and SST- cells 
within TH+ neurons. Scale bars, 20 um in all species. (E) Immunofluorescence 
staining for TH and SST (left) or DDC (right), combined with SST RNA in situ 
hybridization. Scale bars, 20 um. (F) Proportion of SST+ and SST- cells in 


other MGE-related genes including LHX6 and 
PVALB. However, few putative L1 inhibitory 
neurons from other species expressed SST 
(Fig. 4B). We confirmed the presence of these 
inhibitory neurons by performing in situ hybri- 
dization in human tissue against SST and RELN, 
a gene marking human L1 SST—expressing in- 
hibitory neurons in the snRNA-seq data (Fig. 
4C). These results suggest the presence of SST— 
expressing inhibitory neurons in human L1. 
Within homologous inhibitory neuron sub- 
types we detected pronounced transcriptomic 
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changes across species that followed patterns 
similar to those of excitatory neurons, with 
marmosets having the most distinct expres- 
sion profiles, commensurate with phyloge- 
netic relationships of the four primates (fig. 
S14H). MELTF—which encodes melanotrans- 
ferrin and is involved in cellular iron uptake— 
was specifically upregulated in PVALB chandelier 
subtypes in humans (fig. S14H), suggesting 
novel roles for iron homeostasis in fast-spiking 
inhibitory neurons. Analysis of gene families 
(https://www.genenames.org/; materials and 
methods) overrepresented in species-enriched 
genes highlighted collagens in almost all species 
(fig. S141, a pattern also observed in excitatory 
neurons (Fig. 3C). This analysis also revealed 
genes encoding neuropeptides (e.g., NMU and 
PENK) and carbonic anhydrases overrepre- 
sented in human-enriched genes; the former 
is a critical component orchestrating cell-cell 
communications, and the latter can regulate 
neuronal signaling through modulation of pH 
transients (fig. S141). Cumulatively, these results 
suggested possible reorganization of synap- 
tic communication across primate inhibitory 
neurons. 


Human-specific switching between expression 
of SST and TH in interneurons 


We have previously found that inhibitory neu- 
rons expressing TH—the rate-limiting enzyme in 
the biosynthesis of catecholamine neurotrans- 
mitters such as dopamine—are absent from 
the neocortex of nonhuman African apes (i.e., 
chimpanzees, bonobos, and gorillas) (32, 33). 
Given the functional significance of dopamine 
in PFC function (7), we further profiled these 
inhibitory neurons using our dataset. We found 
TH expression in a small subset of dIPFC in- 
hibitory neurons in humans and macaques. Ex- 
pression was notably less in marmosets and 
absent in chimpanzees. Human and macaque 
TH-expressing inhibitory neurons coexpressed 
SST and were transcriptomically classified as 
the SST HGF subclass that was hierarchically 
split into two subtypes: InN LHX6 HGF STON2 
subtype (low 7H expression) and InN SST HGF 
GABRQ subtype (high TH expression) (Fig. 
4D). Homologous subtypes were also found in 
chimpanzees (Fig. 4D), indicating that these 
inhibitory neurons are not absent in chimpan- 
zees but simply do not express TH. Expression 
of TH was also detected in the two homolo- 
gous subtypes in marmosets and some VIP- 
expressing inhibitory neurons but generally 
showed much lower expression compared with 
humans and macaques (Fig. 4D), indicating a 
more complex pattern of species-specific reg- 
ulation of TH expression in these inhibitory 
neurons. 

Since these two subtypes are readily distin- 
guishable by levels of SST expression (high in 
subtype InN SST HGF GABRQ and extremely 
low in subtype InN LHX6 HGF STON2; fig. 
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S15A), we characterized these two subtypes by 
immunofluorescence staining for TH and SST. 
TH-immunopositive inhibitory neurons were 
found in deep layers and the adjacent white 
matter of dIPFC in humans, macaques, and 
marmosets—but not in chimpanzees (Fig. 4D). 
Many TH-immunopositive inhibitory neu- 
rons were also co-immunolabeled for SST in 
macaques (47.4%) and marmosets (24.6%; Fig. 
4D). The two populations or states of TH- 
immunopositive inhibitory neurons distin- 
guished by SST immunolabeling in macaques 
likely represent the two TH-expressing sub- 
types detected in our snRNA-seq data (fig. 
S15A). Furthermore, the lower proportion of 
TH-immunopositive inhibitory neurons co- 
immunolabeled for SST in marmosets may 
be due to the TH expression in marmoset VIP 
inhibitory neurons (Fig. 4D). Consistent with 
this, a subset of marmoset TH-immunopositive 
inhibitory neurons also expressed VIP in upper 
layers (28.1%; fig. S15C). 

By contrast, only a few TH-immunopositive 
inhibitory neurons were co-immunolabeled 
for SST in humans (2.4%; Fig. 4D). Even with 
tyramide signal amplification we could only 
detect trace SST-immunolabeling signals in 
human TH-immunopositive inhibitory neurons 
(fig. S15, D to E). Such low SST-immunopositivity 
among human TH-immunopositive inhibitory 
neurons was inconsistent with our snRNA-seq 
data as one human JH-expressing subtype 
(InN SST HGF GABRQ) exhibited prominent 
SST RNA expression comparable to the homo- 
log subtype in macaques and marmosets (fig. 
SI5A). Such inconsistency prompted us to inves- 
tigate SST RNA expression in TH-immunopositive 
inhibitory neurons using additional tissue 
samples and methods. We complemented the 
immunofluorescence findings with in situ 
hybridization-based detection of SST RNA in 
human dIPFC tissue samples and found that 
most human TH-immunopositive cells expressed 
SST RNA transcripts (83.2%; Fig. 4, E and F), as 
predicted by our snRNA-seq data. This propor- 
tion is notably greater than the proportion of 
inhibitory neurons co-immunolabeled for both 
TH and SST (2.4%), suggesting that certain post- 
transcriptional mechanisms exist in humans to 
downregulate SST protein expression. Addition- 
ally, we profiled SST and TH immunohisto- 
chemistry in the mouse frontal cortex, which 
revealed a proportion of inhibitory neurons 
co-immunolabeled for both SST and TH (57.9%; 
fig. SI5F), similar to macaques (47.4%) but not 
humans (2.4%). Together, these results indicate 
that the switching between expression of SST 
and TH protein, a pattern reminiscent of pop- 
ulations of interneurons in the adult rat hypo- 
thalamus with switch between SST and TH 
expression—and subsequent dopamine produc- 
tion (34)—is also present in certain dIPFC in- 
terneurons in a human-specific manner, at least 
in the context of the primates that we analyzed. 
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Because TH expression itself does not neces- 
sarily effectuate active dopamine production 
or signaling, this prompted us to profile other 
dopamine signaling pathway components in 
our dIPFC dataset (fig. S15, A to B). We found 
that many genes orchestrating key dopamine 
signaling pathway processes, including dopa- 
mine synthesis (TH, GCHT), transport (SLCI8A2), 
and binding to presynaptic receptors (DRD2) 
(7), were expressed in the human subtype InN 
SST HGF GABRQ. However, these genes ex- 
hibited either low or no expression in the other 
subtype InN LHX6 HGF STON2 or in other 
analyzed species (fig. SI5A). DDC was not 
detected in the human InN SST HGF GABRQ 
subtype likely because of its low expression, 
which we could detect in the matched inhi- 
bitory neuron subtype in the human MTG 
data Unh L5-6 SST TH; fig. S15A), which has 
higher sequencing depth using SMART-seq 
(16). In situ hybridization also confirmed the 
presence of RNA transcripts for both TH and 
DDC in the same human dIPFC cells that 
exhibited typical interneuronal morphology 
(fig. SI5K). We further validated the protein 
expression of DDC in TH+/SS7T+ inhibitory 
neurons by performing double immuno- 
fluorescent staining for TH and DDC combined 
with in situ hybridization for SST in human 
dIPFC tissue samples (Fig. 4E). By contrast, in 
mice, DDC immunolabeling was detected in 
putative dopaminergic neurons in the substan- 
tia nigra and ventral tegmental area but not in 
neocortical TH-immunopositive inhibitory neu- 
rons (fig. S1I5L). Although expression of SLC64A3 
was not detected in either our dIPFC or pub- 
lished MTG single cell dataset, SZC47A1, which 
encodes a noncanonical dopamine transpor- 
ter (35), was exclusively expressed in both TH- 
expressing inhibitory neuron subtypes in humans 
(fig. S15A). These results suggest the emergence 
of key dopamine signaling pathway compo- 
nents in a subset of TH-expressing inhibitory 
neurons in humans. 


Conserved and divergent features of glial and 
non-neural subtypes 


Within glia cells and non-neural cells, the ob- 
served taxonomy of oligodendroglial, astroglial, 
microglial, immune, and vascular cells corre- 
sponded to 28 subtypes in humans, 27 subtypes 
in chimpanzees, 25 subtypes in macaques, and 
25 subtypes in marmosets (Fig. 5A and fig. 
S16A). Although these subtypes were labeled by 
conserved subclass markers (fig. SI6B), promi- 
nent species-specific expression profiles were 
detected in each subtype with a pattern con- 
sistent with that of the phylogenetic relation- 
ship among the four primates (Fig. 5B). 
Subtypes related to the oligodendrocyte line- 
age represented a cellular differentiation and 
maturation trajectory spanning from oligoden- 
drocyte precursor cells to mature oligodendro- 
cytes (fig. SI6C). This trajectory was shaped by 
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Fig. 5. Taxonomy and divergent features of dIPFC glia cell and non-neural subclusters. (E) (Top) UMAP plots illustrating microglia and immune subtypes 


cell subtypes. (A) UMAP showing all glia cell and non-neural cell subtypes in the 
four primates with subclasses labeled. (B) Standardized average expression of 
genes (rows) which are species-enriched (top) or species-depleted (bottom) 
across subtypes (columns). Subtypes are color-barcoded on the top with color 
scheme conforming to panel A. Genes were colored according to the subtypes 
where they show enrichment or depletion. (C) Transcriptomic divergence (top) 
and gene expression differences (bottom) of subtypes in the oligodendrocyte 
lineage between each pair of species. HS, human; PT, chimpanzee; MM, 
macaque; CJ, marmoset. (D) (Left) the two gene modules differentially 
expressed among the three interlaminar astrocyte subclusters. (Right) Expres- 
sion of the selected genes across the interlaminar and protoplasmic astrocyte 


detected in the four primates. (Bottom) expression of manually selected subtype 
marker genes. Marker expression labeling species-specific subtypes are high- 
lighted by dashed rectangles. (F) (Left) Illustration of the brain vascular 
architecture from arteries (red), arterioles, and capillaries to veins (blue). (Right) 
Vascular subtypes divided into endothelial (left) and mural cells (pericytes and 
smooth muscle cells) (right) located along the arterial-venous axis. Dot plots 
show expression of marker genes and FLT1 signaling genes. Astro, astrocyte; 
Micro, microglia; Oligo, oligodendrocytes; OPC, oligodendrocyte precursor cells; 
Endo, endothelial cells; PC, pericyte; SMC, smooth muscle cells; VLMC, vascular 
leptomeningeal cells; huMicro, human-specific microglia subtype (Micro P2RY12 
CCL3), hoMicro, Hominini-specific microglia subtype (Micro P2RY12 GLDN). 


conserved gene expression cascades across the 
four primates and transcriptomically resembled 
that described in mice (27) (fig. S16, C and D). 
However, prominent species differences were 
also observed, with transcriptomic divergence 
and individual gene expression differences 
among the four primates increasing along the 
trajectory (Fig. 5C). Evolutionarily divergent 
genes along the trajectory were particularly 
enriched for ligand and receptor binding features 
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such as semaphorins and their neuropilin and 
plexin receptors (fig. S17, A and B), which 
highlights the role of semaphorin signaling 
in regulating oligodendrocyte precursor cell 
recruitment and differentiation as well as 
oligodendrocyte myelination. 

Based on marker gene expression and im- 
putation of layer distribution by integration of 
our data with human MTG and motor cortex 
data (16, 22), astrocytic subtypes were charac- 
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teristic of known astrocyte subpopulations, in- 
cluding interlaminar astrocytes localizing in 
LI, protoplasmic astrocytes populating the gray 
matter, fibrous astrocytes residing in the white 
matter, and reactive astrocytes (fig. S16, E and H) 
(36). Because there is limited characterization of 
interlaminar astrocytes we further subclustered 
these cells to probe their transcriptomic heter- 
ogeneity in greater detail. The resulting three 
subclusters displayed substantial expression 
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differences in 612 genes organized into two 
coexpression modules (Fig. 5D). Module 
1included many genes encoding the major 
effectors for astrocyte-mediated clearance of 
glutamate and GABA from the extracellular 
space, such as glutamate transporters (SLCIA2 
and SLCIA3) and GABA transporters (SLC6A11 
and SLCG6AI; Fig. 5D). Preferential expression 
of these genes in a subset of interlaminar 
astrocytes—juxtaposed to their broad expres- 
sion in protoplasmic astrocytes—suggests a 
role in the clearance of neurotransmitters in 
cortical L1 by a subset of interlaminar astro- 
cytes cells that adopted machineries similar to 
those of protoplasmic astrocytes. 

Subtypes of Microglia, macrophages, myeloid 
cells, and T and B cells formed the immune 
populations in these data (fig. S16A) (37). Al- 
though we found broad conservation of im- 
mune subtypes across species we highlighted 
one microglia subtype specific to humans (Micro 
P2RY12 CCL3) and one microglia subtype 
specific to Hominini (Micro P2RY12 GLDN, 
Fig. 5E). The human-specific microglial sub- 
type expressed signatures of preactivation (FOS, 
CD83), chemokine secretion (CCL3, CCL), anti- 
viral defense (CH25H), costimulation (CD86), 
and proliferation (BTG2, CDKNIA; Fig. 5E) and 
was very similar to a previously reported pre- 
active immune-sensing subpopulation, which 
emerges as early as the 10th gestational week 
and expands to midgestation in the absence of 
inflammation (38). We show that this cell type 
likely persists into adulthood in the human 
cerebral cortex (fig. S17, E and F) (25). Its pre- 
sence was also unrelated to sample age (table 
S1 and fig. S4H). The early developmental rise 
and persistence of this cell type, independent 
of different pathological states, may suggest 
a specific and essential immune homeostatic 
and/or senescence-associated role in contrast 
to specifically induced disease-associated micro- 
glial subtypes (25, 37). The Hominini-specific 
microglial type showed high expression of 
GLDN, MYOIE, PADI2, and PPARG (Fig. 5E). 

In addition to microglia, we identified 1416 
putative T cells, with most (885 or 62.5%) from 
human samples (fig. S161), the abundance of 
which is consistent across the human donors 
analyzed (8.24% + 1.66% among all immune 
cells; fig. S41). This number greatly exceeded 
those of myeloid and B cells (81 and 53 cells, 
respectively), surpassing proportions in peri- 
pheral circulation that would be expected for 
contaminants rather than a brain-resident T cell 
population. Such cells likewise had expression of 
multiple genes not detected during T cell de- 
velopment in the human thymus (fig. S16K). 

Through combinatorial marker gene expres- 
sion and transcriptomic integration with mouse 
snRNA-seq data (23, 24), we found that vascular 
subtypes were predicted to locate along the 
arterial-arteriolar-capillary-venous axis including 
three endothelial subtypes (arterial, capillary, 
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and venous endothelial cells), four mural subtypes 
(pericytes, arterial, arteriolar, and venous smooth 
muscle cells), three vascular leptomeningeal- 
like cell subtypes, and one putative red blood 
cell lineage subtype (fig. S16L; materials and 
methods). Some genes involved in key biologi- 
cal pathways displayed species-specific profiles 
along this vascular axis. This is exemplified by 
FLTI, which encodes the vascular endothelial 
growth factor receptor, with conserved expres- 
sion marking the capillary endothelial subtype. 
Among the three genes encoding its signaling 
molecules, VEGFA was sparsely expressed in 
pericytes of all species whereas VEGFB and 
PGF were enriched in human pericytes (Fig. 
5F), suggesting the presence of human-specific 
signaling shaping vascular cell identity and 
communication. 


Primate- and human-specific expression 
patterns of FOXP2 


By assessing the intersection of genes showing 
human-enriched expression in microglia and 
those showing microglia-enriched expression 
compared with macrophages, we found 19 
genes. FOXP2 was of particular interest among 
those genes given that it encodes a transcrip- 
tion factor mutated in developmental verbal 
dyspraxia (39, 40), and is implicated in a 
number of neuropsychiatric disorders. Although 
previous studies have shown that FOXP2 is 
expressed in neurons of the basal ganglia and 
neocortical layers 6 and 5 (40), its expression in 
microglia is unexpected. Some FOXP2-related 
disorders have been linked to alterations in 
microglia and neuroinflammation (37), high- 
lighting the potential functional importance 
of human-specific FOXP2 expression in the 
context of microglia. We next performed in 
situ hybridization for FOXP2 RNA, combined 
with immunofluorescent staining against IBA1, 
and validated FOXP2 expression in human 
microglia (Fig. 6B). Similarly, we confirmed 
its protein expression in human microglia by 
performing triple-labeling immunofluorescence 
for RBFOX3 (NeuN), IBA1, and FOXP2 in human 
tissue, which revealed FOXP2 expression in 
both RBFOX3-positive neurons and IBA1-positive 
microglia (fig. S18A). Using multiple published 
datasets, we independently validated human 
specificity of FOXP2 expression in microglia 
and found that it was a common feature shared 
by multiple brain regions (fig. S18, B to E); these 
datasets included bulk tissue RNA-seq data of 
sorted microglia from the cerebral cortices of 
eight species (human, macaque, marmoset, 
sheep, mouse, rat, hamster, and chicken; fig. 
S18B) (25), snRNA-seq data from mouse primary 
visual cortex (Fig. 6A) (17), cross-species snRNA- 
seq data of motor cortex (human, marmoset, 
and mouse; fig. S18C) (22), and cross-species 
snRNA-seq data of hippocampus (human, maca- 
que, pig, and mouse; fig. S1I8D) (41). By contrast, 
FOXP2 expression was not detected in macro- 
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phages in brain or other tissues (fig. S18F) nor 
in fetal microglia (fig. S18, G and H). 

We found that FOXP2 is highly expressed 
in several major L5 and L6 excitatory neuron 
subclasses across primates in our data (Fig. 6A), 
which matches previous studies in mice (40). 
However, we also observed previously unknown 
enrichment of FOXP2 expression in L3-5 intra- 
telencephalic excitatory neurons across prima- 
tes, which was not detected in mice (Fig. 6A). 
The L3-5 intratelencephalic excitatory neurons 
likely encompass small neurons that give the 
primate dIPFC L4 its characteristic granular 
appearance (with a high density of small ex- 
citatory neurons). To test whether FOXP2 is 
expressed in neurons of the granular L4, we 
conducted immunohistochemistry in the four 
primates analyzed in this study (Fig. 6C and 
table $3). We found that the majority of neu- 
ronal nuclei in granular L4 of the dIPFC were 
immunolabeled for FOXP2. To test whether this 
expression pattern is specific to primates—the 
only mammals with granular dl|PFC—we ex- 
tended our immunohistochemical study of 
FOXP2 expression to 21 primates and 30 non- 
primate mammals. This represented a total of 
19 orders, including monotremes (represented 
by echidna)—the lineage with the most ancient 
divergence from other mammals (table $3)— 
and allowed us to identify the evolutionary 
conservation of this expression profile. We 
found that the pattern of prominent L4 FOXP2 
expression was only observed in primates (Fig. 
6D and figs. S19 and S20A) and was notable in 
granular PFC and other analyzed granular as- 
sociation areas involved in higher integrative 
functions (table S3). Overall, this indicates that 
broad expression of FOXP2 in L4 excitatory 
neurons is an evolutionary specialization of 
primates. 

To assess temporal expression patterns that 
might inform developmental regulation of 
primate FOXP2 specificity in L4, we performed 
immunohistochemistry on frontal cortices from 
fetal, perinatal, and adult humans, and observed 
FOXP2-immunopositive nuclei in L5 and Lé6 (fig. 
S20B) across all analyzed developmental pe- 
riods. However, human L4 FOXP2 immuno- 
reactivity did not emerge until after 10 days 
postnatal (fig. S20B), suggesting that FOXP2 is 
expressed well after L4 neurons have reached 
their laminar position and myelination and syn- 
aptogenesis are at their peaks (42). This suggests 
that FOXP2 is likely controlled by transcrip- 
tional programs that differ from those control- 
ling FOXP2 expression in L5 and L6 excitatory 
neurons. 


Shared and cell type-specific FOXP2 
regulatory mechanisms 


To identify possible regulatory mechanisms 
and the functional significance underlying the 
cell type- and species-specific FOXP2 expres- 
sion patterns, we next performed sn-multiome 


9 of 13 


RESEARCH | RESEARCH ARTICLE 


A 54 B 
SEK 300 SERED SUI 
54 
= | 010919 did 
ek 36 17} Zee eee 
2 54 
a 
= | 1 OsOPie ptitd 
x of L@a ci ey ec ae eee 
x 5] 
c jie 
$919 | 
2: PREVA , 
4 ll 
0  itae ! A T T 1 + T T T re a ae Ar 76.2% . 
SSIES EES LEESI SEL BELS 
FoF PLP PV VRP awreo SX 
MOE RE s WIBA1+/FOXP2+ 
Mixuman Chimpanzee MMacaque Mi Marmoset MH Mouse IBA1+/FOXP2: 
Cc D i L2/3 L4 L5 L6 WM 
Human nee a a 
a i ee Catarrhini Pong sees - | 
1 ’ LC H, iobates Pelion e t | ' 
\ ae ‘ercocebus agilis ° + | 
Mandiills Spi Hx + + } + 
px Chimpanzee Simiiformes Macaba mulatta a 
" \ — Cercopithecus mitis e + + 
L2 Erythrocebus patas ¢ + + | 
A — Colobus angolensis e } + } 
L1 Primates L— Trachypithecus francoisi + + i 
-— Leontopithecus rosalia ry + 1 | 
= Aotus tigate a ae 
Echidna ; | Saimiri ScILreUS + + | | 
___. Alouatta caraya e | + | 
—— _ Galago senegalensis . + I | 
__. oris tardigradus + + + 
Lemur catta + + + 4 
+ | 
{ | 
t | 
+ 
+ + 
| | 
i | 
Cortex | | 
ae 4 
+ + 
WM 4 + 
+ t + t 
— > Striatum fs : I ! ! % FOXP2+ nuclei 
150 100 50 0 (MYA) 0-25 e50 e75 


Fig. 6. Species- and cell type-specific expression of FOXP2. (A) Violin plots 
of FOXP2 expression across all subclasses in the four p 
neocortex (17). (B) Immunofluorescent staining against IBA1 (red) combined with 
RNA in situ hybridization for FOXP2 RNA (green) in L6 of human dIPFC. Arrowheads 
indicate FOXP2+/IBA1+ microglia, whereas the arrow indicates FOXP2-/IBA1+ 
microglia. The pie chart summarizes the proportion of FOXP2-expressing cells among 
IBAl-immunopositive cells. Nuclei are stained with 4',6-diamidino-2-phenylindole 


(snATAC-seq and snRNA-seq), profiling chro- 
matin accessibility and gene expression simul- 
taneously on the dIPFC of five additional 
neurotypical male and female adult human 
donors (table S1). Following stringent qual- 
ity control we obtained transcriptome data 
from 56,938 nuclei and chromatin accessi- 
bility data from 41,591 nuclei (fig. S21, A and 
B). With the human snRNA-seq data as the 
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imates and mouse 


reference we were able to define the same cell 
subclasses that were similarly distributed across 
donors (fig. S21B). 

Through investigation of chromatin acces- 
sibility and physical contacts (43) in microglia 
from the sn-multiome data and multiple in- 
dependent studies, we found that one cis- 
regulatory element topologically interacted 
with FOXP2 and was exclusive to microglia (fig. 
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(DAPI) (blue). Scale bar, 20 um. (C) Representative images of FOXP2 immunohisto- 
chemistry throughout cortical columns in human, chimpanzee, mouse, and echidna. 
The position of layers and white matter (WM) are indicated. Insets for echidna 
highlight numerous FOXP2-immunopositive nuclei in the striatum and scarce 
FOXP2-immunopositive nuclei in the deep neocortical layers. Scale bars, 100 um 
for all species. (D) Phylogeny dendrogram of the 51 mammals and their corresponding 
laminar distribution of FOXP2-immunopositive nuclei. 


$21, C to D). Similarly, using the sn-multiome 
data we identified several cis-regulatory elements 
proximal to FOXP2 selectively enriched in L3-5 
intratelencephalic excitatory neurons (fig. S21D 
and table S4). 

By integrating the DNA coaccessibility and 
gene coexpression, we constructed gene reg- 
ulatory networks connecting upstream regu- 
lators that positively regulate FOXP2 as well 
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as downstream targets that are either positively 
or negatively regulated by FOXP2, in each of 
the FOXP2-expressing subclasses (fig. S22A 
and table S5). Whereas the upstream regu- 
lators scarcely overlap between subclasses, the 
downstream targets are substantially shared 
among excitatory neuron subclasses (fig. S22, 
B and C). This analysis highlighted NFIA, a 
previously reported FOXP2 cofactor (44) having 
both upstream and downstream regulatory 
roles in L5-6 non-intratelencephalic excitatory 
neurons (fig. S22C). Furthermore, DSCAM, 
which is implicated in Down syndrome, was 
predicted to be positively regulated by FOXP2 
in microglia and exhibited human-specific ex- 
pression in microglia as well (fig. S22, C and D). 
Several genes (PHLDB2, PLCH1, and ILIRAPL2), 
which are positively regulated by FOXP2 in 
L3-5 intratelencephalic excitatory neurons, 
also displayed primate-specific expression in 
these neurons compared to mice (fig. $22, C 
and D). Among these targets, JLIRAPL2 has 
been reported to play a key role in dendritic 
spine formation (45). Using an independent 
algorithm identifying the FOXP2-related reg- 
ulon (a module consisting of a transcription 
factor and its targets) across all cell subclasses 
rather than in a subclass-wise manner, we 
found that the FOXP2-regulon ranked as the 
top regulon of those specific to L3-5 intra- 
telencephalic neurons (fig. S22, E and F). This 
suggests a central role for FOXP2 in the tran- 
scriptional regulation in L3-5 intratelencephalic 
excitatory neurons (fig. S22G). 

To validate the downstream targets in 
excitatory neurons independently, we elec- 
troporated either GFP or FOXP2 (human or 
mouse sequence) into the ventricular walls of 
fetal mice (embryonic day 14.5), followed by 
RNA-seq analysis of neocortices at postnatal 
day 7. From this we were able to validate 
several targets positively regulated by FOXP2, 
including PHLDB2, CALCRL, CNRI, and CALD1 
as well as SHISA6, which is negatively regu- 
lated by FOXP2 (fig. S22C and table $7). Taken 
together, our results highlight cellular, phy- 
logenetic, and developmental features in the 
cortical expression pattern of FOXP2 as well 
as potential regulatory mechanisms that might 
underlie its functional relevance to speech, lan- 
guage, and multiple neuropsychiatric disorders. 


Certain brain disorder risk genes exhibit cell 
type- and species-specific expression 

The species-divergent expression of the neu- 
ropsychiatric disease risk gene FOXP2 promp- 
ted us to characterize the expression of 906 
genes previously associated with 12 major 
brain disorders systematically in this setting 
(materials and methods). By calculating the 
coefficients of variation of gene expression 
across species we found that the risk genes of 
many disorders such as ASD, developmental 
delay disorder, and epilepsy are more conserved 
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than background genes (fig. S23A). Based on 
gene expression conservation these genes were 
parcellated into conserved (320 genes, 35%) or 
divergent (586 genes, 65%) categories (fig. S23B), 
followed by coexpression network analysis with- 
in each category to group these genes into dif- 
ferent modules. In the divergent category, we 
identified multiple modules exhibiting phylo- 
genetic group- or species-specific (including 
human-specific) patterns (fig. S23C and table 
S8). For example, two Parkinson’s disease risk 
genes (GBA and GOSR2, module 22) that regulate 
the degradation and secretion of alpha-synuclein 
displayed Hominini-enriched expression in a 
subset of excitatory neuron and inhibitory neuron 
subclasses (fig. S23D). 

To gain a more detailed view into the cell 
type-specific expression of the 586 genes with 
species-divergent expressions we performed dif- 
ferential expression analysis across subclasses. 
Of those genes, 157 (27%) exhibited subclass- and 
species-specific expression patterns (table S9). 
These are exemplified by the multiple sclerosis 
risk gene MERTK, enriched in human L3-5 
IT excitatory neurons, and the ASD risk gene 
CACNAID, enriched in human microglia. 
MERTK showed conserved expression in 
microglia, consistent with its known function 
in myelination equilibrium regulation by mod- 
ulating microglia activities (fig. S23F). However, 
its human-specific expression in L3-5 intra- 
telencephalic neurons suggests an unchar- 
acterized role in human excitatory neurons. 
Similarly, the ASD risk gene NR4A2, which 
encodes a transcription factor, exhibited human- 
specific expression in smooth muscle cells 
(fig. S23, E and F), suggesting possible re- 
organization of transcriptional regulation in 
anthropoids. 


Conclusions 


We generated a comprehensive transcriptomic 
survey of dIPFC neuronal, glial, and non-neural 
cells in the most commonly studied anthropoid 
primates. We also revealed and characterized 
species differences in transcriptomic profiles, 
composition, and diversity of cell types, includ- 
ing those associated with evolutionary special- 
izations and functionality of dIPFC. 

Although most cell subtypes are shared 
across the four species studied we identified 
5 species-specific subtypes. These subtypes 
showed balanced proportions in all donors 
of a given species and some were detected in 
independent studies, corroborating the robust- 
ness of identifying cellular differences across 
species. Nevertheless, detection of a subtype 
in a subset of species does not necessarily 
implicates species-specificity but alternatively 
could represent potential differences in cell 
type abundance or individual variability across 
species. To address this we performed marker 
gene enrichment and in situ hybridization (figs. 
S4 to S7) and revealed that even in species where 
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we did not detect a cell subtype, rare cells ex- 
pressing markers of those subtypes may exist. 
Although such results still substantiated spe- 
cies differences, species-specificity can be better 
resolved by profiling more cells and donors. 
Alternatively, species specificity might repre- 
sent different cell states of homologous sub- 
types, and their species-specific presence could 
be associated with differences in living environ- 
ments and lifestyles between these species. This 
might associate with baseline inflammation dif- 
ferences across species, which is particularly 
important for immune populations such as 
microglia and astrocytes. 

Furthermore, we found that certain genes 
associated with key biological pathways exhibited 
cell type- and species-specific expression dif- 
ferences, featured by the switch of SST and TH 
expression in a subset of human inhibitory 
neurons. A possible mechanism underlying 
the posttranscriptional regulation of SST ex- 
pression is that mRNA sequence changes lead 
to gain or loss of regulation from ribosome- 
binding proteins. Consistent with this, SST 
protein sequence changes are highly conserved 
in primates and the mRNA sequences included 
one nonsynonymous mutation specific to 
humans and multiple other synonymous muta- 
tions specific to apes (fig. S15, G to J). 

These putative dopaminergic interneurons 
have been previously described (32) but their 
molecular characterization and developmen- 
tal origin have remained unknown. Our study 
demonstrated that these interneurons express 
SST as well as other key genes that are es- 
sential for dopaminergic function and likely 
originate in the medial ganglionic eminence. 
Notably, it has been reported that humans— 
as well as other great apes—have an enrich- 
ment of dopaminergic afferents in the prefrontal 
cortex (33) and our results suggest that humans 
have an extra population of dopaminergic in- 
terneurons in these cortical areas. This poten- 
tial increase in dopamine levels might have 
functional relevance for the evolution of human 
cognitive abilities and behavior (46). 

In addition, notable molecular differences 
across species were unveiled in many neuro- 
psychiatric disease risk genes, exemplified by 
human-specific FOXP2 expression in microg- 
lia and primate-specific FOXP2 expression in 
14 excitatory neurons. Although such expres- 
sion patterns are associated with cell type- 
specific cis-regulatory elements, future studies 
are needed to determine their species speci- 
ficity. As a transcription factor, the species- 
specific expression of FOXP2 also leads to 
species-specific transcriptional changes in the 
14 excitatory neurons or microglia (fig. S22), 
including upregulation of JLIRAPL2 involved 
in dendritic spine formation and DSCAM im- 
plicated in Down syndrome. Together, our re- 
sults demonstrated this data as a resource for 
future studies in selecting primate models or 
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developing human-specific platforms to model 
diseases. 


Materials and methods summary 


In the snRNA-seq experiment, the brain cell 
nuclei were isolated according to our previous 
protocol (J0, 13, 41) and the following nuclei 
capture and library preparation were conducted 
using Chromium Single Cell 3, Solution v3. The 
snRNA-seq reads were preprocessed with Cell- 
Ranger using the consensus genome built in this 
study based on reciprocal exon liftOver. Doublet 
removal was carried out using scrublet (47) 
and custom AUCell-based (48) doublet detec- 
tion scripts. Downstream batch correction, di- 
mension reduction and cell clustering were 
performed via Seurat (49). Cell subtype sepa- 
rability was measured based on AUROC scores 
evaluating cell type identity prediction in down- 
sampled data where one donor was random- 
ly removed. Detection of cell type markers 
and species-specific genes were realized using 
Wilcoxon Rank Sum test. Cell cluster identity 
annotation was based on marker gene expres- 
sion and transcriptomic comparisons with 
multiple published data sets (16-24). 

In assessing the global cellular and tran- 
scriptomic changes across species, AUCell-based 
(48) cell type marker enrichment was used 
to evaluate species-specificity of the detected 
species-specific subtypes. Cell subtype abun- 
dance comparisons were carried out using the 
scCODA algorithm (50), transcriptomic heter- 
ogeneity assessment was calculated via custom 
scripts based on Shannon entropy, and tran- 
scriptomic divergence was defined as the Pearson 
correlation coefficients subtracted from 1. 

In the sn-multiome experiment, the brain cell 
nuclei isolation is a comparable modification 
of our previous protocol (10, 13, 41) followed by 
library construction using Chromium Single 
Cell Multiome ATAC + Gene Expression (10x 
Genomics PN 1000283). The sn-multiome data 
was preprocessed with CellRanger and MACS2 
(51) followed by downstream filtering and 
dimension reduction using Seurat and Signac 
(49, 52). Cell type annotation was achieved via 
the label transfer algorithms in Seurat using 
the human snRNA-seq data as the reference. 
Detection of differentially accessible peaks 
was performed using the logistic regression 
implemented in Seurat. FOXP2 regulatory 
networks were constructed via scanpy (53), 
cicero (54), CellOracle (55) and pySCENIC (56) 
by integrating the DNA coaccessibility and 
gene coexpression information. 

In the mouse in utero electroporation expe- 
riment, either FOXP2 or GFP was electroporated 
into embryonic day 14.5 mice ventricular walls 
and RNA-seq was performed on neocortices at 
postnatal day 7. Analysis of the RNA-seq data 
included reads alignment via STAR (57), reads 
counting via HTSeq (58) and differential gene 
expression via DESeq2 (59). 
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In analysis of FOXP2 cortical expression 
across species, immunohistochemistry for 
FOXP2 was conducted in 21 primate and 30 
nonprimate mammals with subsequent quan- 
tification of FOXP2 laminar distribution. 

In the validation of species-specific subtypes, 
RNA in situ hybridization was performed with 
RNAscope on human, chimpanzee, and marmo- 
set dorsolateral prefrontal cortices with probes 
targeting LAMP5 and CD8A, CUX2 and PRLR, 
or AQP4 and CHI3L1. 

In characterization of SST expression in TH- 
expressing interneurons across species, immu- 
nohistochemistry of TH and SST was performed 
in human, chimpanzee, macaque, and mar- 
moset dorsolateral prefrontal cortices and the 
percentage of SST+/TH+ interneurons was 
quantified. RNA in situ hybridization with 
SST probe was combined with immunohisto- 
chemistry of TH and SST protein in human 
dorsolateral prefrontal cortices utilizing RNA- 
scope codetection. 
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Aqueously altered igneous rocks sampled 
on the floor of Jezero crater, Mars 


K. A. Farley et al. 


INTRODUCTION: The Perseverance rover landed 
in Jezero crater on Mars on 18 February 2021 
with three scientific objectives: to explore the 
geologic setting of the crater, to identify an- 
cient habitable environments and assess the 
possibility of past martian life, and to collect 
samples for potential transport to Earth for 
analysis in laboratories. In the 290 martian 
days (sols) after landing, Perseverance ex- 
plored rocks of the Jezero crater floor. 


RATIONALE: Jezero, a 45-km-diameter crater, 
was selected for investigation by Perseverance 
because orbital observations had shown that 
it previously contained an open-system lake, 
prior to ~3.5 billion years ago. Major climate 
change then left Mars in its current cold and 
dry state. On Earth, broadly similar environ- 
ments of similar age to Jezero contain evidence 
of microbial life. Jezero crater contains a well- 
preserved delta, identified as a target for 
astrobiological investigation by the rover. Per- 
severance landed ~2 km away from the delta, 
on rocks of the crater floor. Previously pro- 


posed origins for these rocks have ranged from 
lake (or river) sediments to lava flows. Olivine- 
rich rocks identified on the crater floor, as well 
as in the area surrounding Jezero, have pre- 
viously been attributed to a widely distributed 
impact melt or volcanic deposit, variably al- 
tered to carbonate. We used Perseverance to 
investigate the origin of the crater floor rocks 
and to acquire samples of them. 


RESULTS: The Jezero crater floor consists of 
two geologic units: the informally named 
Maaz formation covers much of the crater 
floor and surrounds the other unit, which is 
informally named the Séitah formation. Maaz 
rocks display a range of morphologies: struc- 
tureless boulders, flagstone-like outcrops, 
and ridges that are several meters high. The 
ridges expose prominent layers, ranging in 
thickness from a few centimeters to a few 
tens of centimeters. Rocks of Séitah are often 
tabular and strongly layered, with layer thick- 
nesses ranging from centimeters to meters. 
Maaz and Séitah rocks display no outcrop 


Sample collection by Perseverance on Mars. This image mosaic was acquired by the WATSON camera on the 
rover's robot arm. Rock cores were drilled from the two holes (arrow) in an igneous rock of the Maaz formation. The 
6-cm-long, 1.3-cm-diameter cores were sealed into individual sample tubes and are now stored inside the rover. 
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or grain-scale evidence for transport by wind 
or water. 

Perseverance observations show that the 
Maaz rocks consist of 0.5- to 1-mm interlocking 
crystals of pyroxene and plagioclase. Combined 
with bulk chemical composition measure- 
ments, this suggests Maaz is an igneous unit 
that cooled slowly. In contrast, most Séitah 
rocks are very rich in magnesium and are dom- 
inated by densely packed 2- to 3-mm-diameter 
crystals of olivine, surrounded by pyroxene. 
These properties indicate settling and accu- 
mulation of olivine near the base of a thick 
magma body, such as an intrusion, lava lake, 
or thick lava flow. Ground-penetrating radar 
indicates that Séitah rocks dip beneath the 
Maaz formation. We hypothesize that Maaz 
could be the magmatic complement to the 
Séitah olivine-rich rocks or, alternatively, Maaz 
could be a series of basaltic lavas that flowed 
over and around the older Séitah formation. 

The olivines in the Séitah formation are 
rimmed with magnesium-iron carbonate, likely 
produced by interaction with CO,-rich water. 
Maaz formation rocks contain an aqueously 
deposited iron oxide or iron silicate alteration 
product. Both units commonly contain patches 
of bright-white salts, including sodium per- 
chlorate and various sulfate minerals. Although 
both rock units have been altered by water, 
preservation of the original igneous minerals 
and the absence of aluminous clay minerals 
indicate that the alteration occurred under low 
water/rock ratio and that there was little loss 
of soluble species to the surroundings. It re- 
mains unclear when these aqueous processes 
occurred and whether they relate to the lake 
that once filled Jezero. 

The exposure of the olivine-rich Séitah rocks 
at the surface, the absence of lake or river sedi- 
ment in the exploration area, and several near- 
by erosional remnant hills of delta sediment 
indicate that substantial crater floor erosion 
occurred after formation of these igneous units. 

Samples of both of these geologic units were 
collected as drill cores. The drill cores were 
stored in ultraclean sample tubes, for potential 
transport to Earth by future missions in the 
early 2030s. 


CONCLUSION: The floor of Jezero crater ex- 
plored by Perseverance consists of two dis- 
tinct igneous units that have both experienced 
reactions with liquid water. Multiple rock cores 
were collected from these units for potential 
transport to Earth and analysis in terrestrial 
laboratories. © 


All authors and affiliations appear in the full article online. 
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The Perseverance rover landed in Jezero crater, Mars, to investigate ancient lake and river deposits. 
We report observations of the crater floor, below the crater’s sedimentary delta, finding that the floor 
consists of igneous rocks altered by water. The lowest exposed unit, informally named Séitah, is a 
coarsely crystalline olivine-rich rock, which accumulated at the base of a magma body. Magnesium-iron 
carbonates along grain boundaries indicate reactions with carbon dioxide-rich water under water-poor 
conditions. Overlying Séitah is a unit informally named Maaz, which we interpret as lava flows or the 
chemical complement to Séitah in a layered igneous body. Voids in these rocks contain sulfates and 
perchlorates, likely introduced by later near-surface brine evaporation. Core samples of these rocks have 
been stored aboard Perseverance for potential return to Earth. 


he Perseverance rover, part of the Mars 
2020 mission, landed on the floor of 
Jezero crater on 18 February 2021. Per- 
severance was designed to characterize 

the geology, investigate records of past 
climate, and seek evidence of possible ancient 
life contained in rocks laid down when mar- 
tian surface conditions were more habitable 
than they are today (7). The multiyear mission 
is intended to select, collect, and document 
~35 rock and soil samples for potential trans- 
port to Earth by future missions for laboratory 
analysis (7). We report results from the science 
campaign that ran until mission sol (martian 
day) 290, exploring the geology and geochem- 
istry of geological units on the Jezero crater 
floor. These rocks, which form part of a re- 
gional stratigraphy, record an ancient period 
when impact cratering, volcanism, and water 
were actively shaping the surface of the planet. 
Jezero crater, with a diameter of 45 km, lies 
on the western flank of Isidis, a giant impact 
basin of Noachian age [~4.0 billion years ago 
(Ga)] (2) and northeast of Syrtis Major, a 
younger shield volcano (3). Jezero was formed 
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by an impact onto an early Noachian base- 
ment unit that includes Isidis impact deposits 
(3-6). The crater later hosted an open-basin 
lake, fed by a large system of river valleys 
that were active in the late Noachian (~3.6 to 
3.8 Ga) (7) to early Hesperian (>3.2 Ga) (8). 
Sediments delivered by those rivers into the 
crater lake produced a clay and carbonate- 
bearing delta, which appears to be well pre- 
served (4, 8-10). The valleys that fed Jezero 
pass through a regional stratigraphy that in- 
cludes an olivine-bearing unit, partially al- 
tered to carbonate (J/, 12), that overlies the 
Noachian basement. The olivine unit is in turn 
overlain by a low-albedo crater-retaining unit, 
with spectral signatures of pyroxene (5, 13). 
Mars’ three largest impact basins are each 
surrounded by such olivine-bearing rocks (/4). 
The olivine unit on the southern and western 
margins of the Isidis basin is the largest con- 
tiguous olivine-rich unit on Mars, with sev- 
eral hypothesized origins: impact melt (3, 5), 
extrusive or intrusive magmatism (15-17), vol- 
canic ash fall (78), pyroclastic surge deposi- 
tion (19), and detrital sedimentation (20). 
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Rock units potentially equivalent to this 
regional stratigraphy occur on the floor of 
Jezero crater (9, 21). Orbital mapping of the 
Jezero crater interior (21) before landing de- 
lineated a pyroxene-bearing unit, with a high 
density of superposed impact craters, named 
Crater Floor Fractured Rough (hereafter Cf- 
fr), and an olivine-bearing unit named Crater 
Floor Fractured 1 (Cf-f-1). Perseverance direct- 
ly accessed these two geologic units (Fig. 1B). 
The rocks of Cf-fr, informally named the Maaz 
formation (see supplementary text in the sup- 
plementary materials), have been interpreted 
to be lava flows (9, 22) or sandstones deposited 
by flowing water or wind (2/, 23). Crater count- 
ing on this unit constrains the ages of Jezero 
crater floor rocks, but studies have yielded 
inconsistent age estimates ranging from 1.4 
to 3.45 Ga (9, 22, 24). The spatially hetero- 
geneous crater density on this unit (Fig. 1B) 
is likely the source of this age discrepancy 
and indicates that the Méaz formation ex- 
perienced a complex history of surface expo- 
sure. Olivine-bearing rocks occur outside Jezero 
crater, draping across the rim, and within the 
crater (9). The olivine-bearing rocks of the 
Cf-f-1 unit, informally named the Séitah for- 
mation, have previously been proposed to 
share an origin with the regional olivine- 
bearing unit (9, 21, 25), possibly reworked or 
chemically modified by water within Jezero 
crater (27, 23). Although the Séitah formation 
has been interpreted as older than the delta 
and its associated lake (21, 23, 26), the age 
of the Maaz formation relative to the delta is 
debated. The Maaz formation has been inter- 
preted as one of the youngest units in the 
crater, postdating all delta deposition (27), or 
as substantially older, predating the delta 
(21, 23, 28). Much of the Séitah formation is 
topographically higher than the Maaz for- 
mation, but previous studies agree that it is 
nonetheless older (23, 28). 

During atmospheric entry, the Mars 2020 
spacecraft’s Terrain Relative Navigation sys- 
tem (27) autonomously selected the highest- 
probability safe landing site available, which 
was at 18.4447°N, 77.4508°E, between the 
heavily cratered Maaz terrain to the east and 
the abundant aeolian bedforms of the Séitah 
terrain to the west (Fig. 1). This location has 
been informally named the Octavia E. Butler 
landing site (OEB); it lies on a narrow strip 
of smooth topography, ~2 km southeast of the 
delta scarp and within tens of meters of 
the Séitah-Maaz geologic contact. Neither 
the cratered terrain of the Maéaz formation 
nor the rugged sandy terrain of the Séitah 
formation is suitable for rover operations, 
but a narrow smooth strip of Maaz rocks near 
their contact provided a traversable pathway 
for the mission’s first science campaign. In 
June 2021, after completion of hardware com- 
missioning and demonstration flights of the 
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Ingenuity helicopter, the rover embarked on 
an out-and-back-again journey to the south 
and then west, with the goal of documenting 
the two formations. This route provided ac- 
cess to abundant surface exposure of Maaz 
rocks and more-limited exposures of Séitah 
bedrock, along with a scarp informally named 
Artuby ridge, which delimits the two units 
(Fig. 1C). 


Fig. 1. Orbital images of the Jezero crater floor 
study area. (A) Mars Reconnaissance Orbiter 
(MRO) Context Camera (CTX) base map overlain 

by Mars Express High Resolution Stereo Camera 
color images. The study area is on the western 

side of Jezero crater, a few kilometers east of the 
delta (indicated by the letter D). (B) High Resolution 
Imaging Science Experiment (HiRISE) color image 
of the rover's landing site at the Octavia E. Butler 
landing site (OEB), and its traverse up to sol 

290 (white path; each circle on the path is the end 
of a sol). The locations of the four abrasion patches 
are labeled in italics, and four landforms, Séitah, 
Artuby ridge, the Kodiak delta remnant (60), and 
the Mure outcrop, are labeled in roman. Séitah 

is characterized by abundant aeolian bedforms 

and northeast-southwest striking ridges. The region 
east and southeast of OEB is part of the Maaz 
formation, which exhibits high but variable impact 


Rover observations 

We used Perseverance to acquire colocated 
textural, compositional, and mineralogical 
maps, at both the microscopic and outcrop 
scale, along a 2.4-km traverse of the Jezero 
crater floor. Subsurface structure was imaged 
with a ground-penetrating radar (29). We char- 
acterized the natural surfaces of rock outcrops 
along the rover traverse using the Mastcam-Z 


Seitah 


Mure 


multispectral stereo camera (30), the WATSON 
[Wide Angle Topographic Sensor for Opera- 
tions and eNgineering (32)] microscopic camera, 
and the SuperCam laser-induced breakdown 
and visual-infrared spectrometers (32). These 
instruments are most effective when there is 
minimal coating of rock surfaces by dust and 
other materials. Removal of dust is also re- 
quired for the proximity science instruments 
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crater density. (€) Enlarged portion of (B), overlain with a geologic map showing our interpreted distribution of the Séitah and Maaz formations. Colors indicate 
mapped distribution of formation members (labeled), and the formation contact is indicated by the dashed white line. The Content member is only visible at the point 
indicated. Figure S5 shows stratigraphic columns associated with this map. 
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SHERLOC (Scanning Habitable Environments 
with Raman & Luminescence for Organics & 
Chemicals), a Raman and fluorescence spec- 
trometer (37), and PIXL (Planetary Instrument 
for X-ray Lithochemistry), an x-ray fluores- 
cence spectrometer (33). To remove the dust 
coating, rocks were studied after abrading a 
circular pit of ~5 cm diameter and ~1 cm depth 
into each target. Debris was removed from 
the resulting abrasion patch using compressed 
No gas (34). Proximity observations were 
taken at four abrasion sites (informally named 
Guillaumes, Bellegarde, Garde, and Dourbes; 
Fig. 1B); sample collection was attempted at 
each of these sites except Garde. 


Descriptions of Maaz and Séitah formations 
Maaz 

The Maaz formation consists of blocky, mas- 
sive, and layered bedrock (Fig. 2 and fig. S1). 
Natural surfaces of Maaz formation rocks con- 
tain occasional millimeter-sized grains and 
millimeter-to-centimeter-sized voids or pits. 
Many Maaz formation outcrops appear pol- 
ished and grooved by aeolian abrasion. Others 
appear to be failing along concentric fractures, 
consistent with spheroidal weathering (fig. 
S1C). Maaz rocks are often partially covered 
by regolith, dust, and sometimes a discon- 
tinuous purple coating or rind (Fig. 3A); rock 


textures are more clearly observed on mechan- 
ically abraded surfaces. Sedimentary struc- 
tures formed by grain transport, such as 
migrating ripples and dunes, were not ob- 
served within this formation. 

The Roubion member is exposed at the 
lowest elevation along the traverse, and we 
therefore infer it to be the oldest interval of 
the Maaz sequence (fig. S5). This member is 
composed of meter-scale low-lying (<5 cm of 
relief) rock surfaces, which are polygonally 
fractured. The Guillaumes abrasion patch was 
performed on such an outcrop of the Roubion 
member; it reveals light and dark grains with 
a pervasive but heterogeneously distributed 
brown tint (Fig. 3A). In the least-brownish 
regions, we observed millimeter-sized inter- 
locking dark gray or green and light-toned 
prismatic grains, with no evidence of inter- 
granular porosity or visible cement. Although 
sometimes elongated, grains show no pre- 
ferred orientation. If this rock is igneous, 
the coarse grains suggest a moderately slow 
cooling rate (i.e., it is a holocrystalline or mi- 
crogabbroic rock). Irregularly shaped voids 
or pits, up to 5 mm in diameter, are present 
within the abrasion patch, which also con- 
tains elongated crevices. The edges of these 
features, and several other areas on the abra- 
sion surface, appear to be coated by a smooth 


Fig. 2. Rover images of igneous rocks of the Maaz formation. All panels show enhanced-color Mastcam-Z 
images. (A) Image acquired on sol 138, showing low-relief, friable outcrops of the Roubion member (mb; 
foreground) overlain upslope by resistant caprocks of the Rochette member. (B) Image acquired on sol 1/5, 
showing layered rocks of the Artuby member, which are capped by the resistant rocks of the Rochette member. 
Artuby member layers range in thickness from <1 cm up to several decimeters and are generally planar. White 
arrows point to rounded outcrop protrusions. (C) Image acquired on sol 282, showing layered caprock of the 
Rochette member. Layers range in thickness from several centimeters to several decimeters and show an 
apparent thickening-upward trend. Layers are planar and dip ~10° to the south (into the page). Source images 
(39) in this figure and shown below have been white-balanced, contrast-stretched, and gamma-corrected to 


improve viewability and contrast. 
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red-brown or black mantle that partially or 
completely obscures the underlying grains. 
Many of the lighter-toned grains are variably 
stained reddish brown. Millimeter-sized irreg- 
ularly shaped patches of bright-white material, 
some also stained brown, occur between grains 


Fig. 3. Microscope images of abrasion patches. 
(A) WATSON image of the Guillaumes abrasion 
patch, on the Roubion member of the Maaz 
formation, acquired on sol 160. Millimeter-sized 
light- and dark-toned minerals interlock, bright- 
white and brown-to-black material coats surfaces 
and fills interstices, and several holes and crevices 
penetrate into the rock. (B) Combined SHERLOC 
and WATSON images of the Bellegarde abrasion 
patch, on the Rochette member of the Maaz formation, 
acquired on sol 186. Light and dark minerals 
ranging from 0.5 to 1 mm in size are present, coated 
or stained with brown material, but less extensively 
than in Guillaumes. The black arrow points to 
bright-white material, also stained brown, likely filling 
a void in the rock. (©) Combined SHERLOC and 
WATSON images of the Dourbes abrasion patch, on the 
Bastide member of the Séitah formation, acquired on 
sol 257. Light-toned grains ranging from 2 to 3 mm 
in size are surrounded by a dark-green mineral. 
Brown material occurs on most grain boundaries, 
sometimes coating the lighter-toned grains. 
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(likely in filled voids) and as a surface coating on 
some of the prismatic grains. Collectively, these 
observations are consistent with an igneous rock 
that has experienced aqueous alteration (i.e., 
modification by reactions with water). 

Up-section and westward along Artuby 
ridge, an ~1-km-long linear scarp that reveals 2 
to 3 m of southwest-dipping stratigraphy on 
its northeastern side (Fig. 2B), the Roubion 
member transitions into the Artuby member, 
distinguished by the presence of distinct lay- 
ering. The Artuby member is composed of 
decimeter-thick layers with rounded protru- 
sions and an apparently coarse granular surface 
texture, although this is sometimes obscured 
by regolith (Fig. 2 and fig. $1). The thick layers 
are separated by, and sometimes transition 
laterally into, thin recessive subcentimeter- 
scale layers. The layered rocks of the Artuby 
member are typically overlain by a resistant 
caprock, the Rochette member, that can be 
traced laterally over hundreds of meters along 
the crest of Artuby ridge. This caprock also 
appears to extend southward from the ridge 
as fragmented blocky flat-lying exposures par- 
tially covered by regolith (Fig. 1B). The Rochette 
member is ~30 to 50 cm thick and sometimes 
appears structureless but more commonly ex- 
hibits centimeter-scale layering. At an outcrop 
informally named Mure (fig. S1, D and E), the 
Rochette caprock exhibits thin layers that 
pinch out below more-massive layers and an 
~2-cm-wide chain of pits (possibly vesicles) 
that for ~75 cm traces the curvature of the 
layer in which it is enclosed. A second abra- 
sion patch, Bellegarde, was performed on a 
tabular boulder of the Rochette member, on 
the crest of Artuby ridge. Bellegarde is char- 
acterized by smaller crystals (~0.5 mm) of sim- 
ilar appearance to Guillaumes (Fig. 3). These 
crystals are occasionally stained by a reddish- 
brown material that is less pervasive than in 
Guillaumes but appears otherwise similar. 
Irregular white patches are also present, but 
there are no large pits or thick dark coatings 
in Bellegarde. 

Resistant, blocky caprocks, like those of the 
Rochette member, are also observed in several 
~1- to 2-m-high scarps along the eastern edge 
of Séitah (west and southwest of OEB). Here, 
the Rochette member appears to be overlain 
by disaggregating low-relief polygonal out- 
crops of the Nataani member. Rocks of the 
Nataani member appear morphologically and 
lithologically similar to those of the under- 
lying Roubion member but are distinct in 
their geographic extent, elevation, and in- 
ferred local stratigraphic relationships (Fig. 1B 
and fig. S5). The low-relief Nataani member 
transitions eastward and up-section into 
the boulder-forming hills of the Chal mem- 
ber. East of OEB and the southbound rover 
traverse, massive submeter-to-meter-scale 
boulders of the Chal member erode from hilly 
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outcrops, with no signs of internal layering. 
This rough terrain extends to cover most of the 
crater floor and is associated with the highest 
crater densities observed within Jezero (Fig. 1B). 
Séitah 

Bedrock exposure of the Séitah formation is 
mostly limited to ~1- to 10-m-tall ridges trend- 
ing northeast-southwest, with the remainder of 
the region covered by sand ripples, loose rocks, 
and boulders (Fig. 4 and fig. $2). The Séitah 
formation comprises the strongly layered Bas- 
tide member and the overlying Content mem- 
ber, which is structureless and characterized 
by abundant millimeter-to-centimeter-sized 
pits (fig. S2A). Long-distance imaging shows 
that the Bastide member is widespread and 
exposed in outcrops throughout the several 
square kilometers of observed Séitah forma- 
tion. Most layers of the Bastide member are 
1 to 3 cm thick, but the thickest layers are 
apparently structureless over thicknesses of 
10 to 40 cm. Thinner layers sometimes tran- 
sition laterally into massive outcrops. There is 
an apparent thickening-upward trend within the 
Bastide member, in which thinly layered rocks 
transition up-section into thicker-layered rocks 
that cap ridges (fig. S2C). Most layers are 
generally tabular and even in thickness, with 
planar layer-bounding surfaces, although 
some exhibit centimeter-scale undulations. 
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Individual layers can sometimes be traced lat- 
erally over distances of several meters within 
an individual outcrop, but individual layers 
and distinctive sets of layers usually cannot 
be correlated between outcrops. We identi- 
fied no indicators of transport by wind or 
water, such as planar lamination and ripple 
or dune-cross stratification. The pitted rocks 
of the overlying Content member do not ex- 
hibit layering, and they lack olivine (as dis- 
cussed in the section on Séitah mineralogy). 
The Content member is observed only locally, 
exposed over lateral distances no greater than 
several meters. Its contact relationship with 
the Bastide member is unknown. 

Natural surfaces of Bastide member out- 
crops show densely packed, clast-supported, 
homogeneous 2- to 3-mm-diameter dark-gray 
or green grains (or crystals) with indistinct, 
lighter-toned material between grains (Fig. 
4C). The grain-size distributions do not vary 
across multiple outcrops, nor did we visually 
detect grain-scale sorting or grading at ap- 
parent layer boundaries. The millimeter-sized 
grains, when observed in abraded surfaces 
of Bastide (the Dourbes and Garde abrasion 
patches), are light toned and mostly equant, 
but some elongated grains are also present 
(Fig. 3C) (35); we observed no gradation in 
grain size in the abrasion patches. Less com- 
mon than the light-toned grains are slightly 
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Fig. 4. Layered and coarse-grained appearance of the Bastide member of the Séitah formation. 

(A) Enhanced-color Mastcam-Z mosaic of Bastide, acquired on sol 201, showing bedrock exposures eroding 
into tabular boulders, which occur mostly on ridges. Bastide member bedrock varies from thinly layered 

to structureless, and individual layers cannot typically be traced more than a few meters. The platy and tabular 
appearance of the Séitah formation contrasts strongly with the more-rounded boulders of Maaz (Fig. 2A). 

(B) Enhanced-color Mastcam-Z mosaic, acquired on sol 204, of the Bastide outcrop where the Garde 
abrasion was performed. Centimeter-scale layering is accentuated by differential erosion. (©) SuperCam 
mosaic of the Cine abrasion target on the Bastide formation, acquired on sol 206. On surfaces with 
little dust or coating, the Bastide member rocks reveal abundant 2- to 3-mm angular to rounded gray- 


green grains, separated by lighter-toned material. 


30 September 2022 


4 of 11 


RESEARCH | RESEARCH ARTICLE 


smaller dark-gray or green grains that are 
sometimes angular but more often appear to 
fill the space between lighter grains. Light- 
brown material stains the light-toned grains 
and is concentrated between grains, in some 
cases encircling or rimming them. A few 
submillimeter-scale bright-white patches are 
also present. Although a few natural surfaces 
of Bastide member rocks contain millimeter- 
scale pits, the occasional voids in the two 
abraded surfaces are much smaller. 

Near the base of Artuby ridge, at the tran- 
sition between the Maaz and Séitah forma- 
tions, Séitah layering is observed both in 
outcrop and in the subsurface with ground- 
penetrating radar. The layering typically dips 
~10° to 15° to the southwest (fig. S3). These 
dips are consistent with those of Maaz for- 
mation rocks exposed along Artuby ridge. This 
indicates that Séitah formation bedrock lies 
stratigraphically below Maaz formation rocks, 
even though the Maaz formation is topograph- 
ically lower than some Séitah exposures, con- 
firming a pre-landing hypothesis (23). Thus, 
the Séitah formation is the lowest exposed 
stratigraphic unit on the Jezero crater floor, 
which is overlain by the Maaz formation. The 
boundary between the Séitah and Maaz for- 
mations is obscured by regolith, so it is un- 
known whether the units are separated by an 
unconformity. The inferred contact between 
the two units varies in elevation, by ~10 m 
along Artuby ridge and up to ~30 m around 
the entire Séitah exposure. 


Méaaz formation geochemistry and mineralogy 


Mastcam-Z multispectral observations (30) of 
Maaz formation rocks along the traverse show 
broad absorption bands centered near 0.88 
to 0.90 um, consistent with pyroxene or Fe*?- 
bearing alteration minerals (fig. S4). These are 
similar to the weak pyroxene bands observed 
from orbit across the eastern portion of the 
crater floor (9, 36). Seventy-seven Maaz for- 
mation targets were analyzed with SuperCam 
infrared (IR) spectroscopy (32); these spectra 
frequently show a band at 1.92 to 1.93 um, 
which we attribute to hydrated iron oxides, 
ferric smectites, or hydrated salts (fig. S6) (37). 
Weaker absorption features observed at 1.42 
and 2.28 um are consistent with Fe-OH bands 
in hydrated silicates (37). 

SuperCam laser-induced breakdown spec- 
troscopy (LIBS) (32) was used to characterize 
the elemental composition of 67 Maaz forma- 
tion targets. At its typical operating distance 
of 2.7m from the rover, the laser beam inter- 
rogates a region ~350 um in diameter at each 
of the 5 to 10 observation points investigated 
on each rock target. In some cases, individual 
points had elemental abundances consistent 
with specific minerals rather than mixtures. 
From these points we infer the presence of 
common igneous minerals (augite, plagioclase, 
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ilmenite, ferrosilite, and unidentified iron 
oxides), at least some of which are moderately 
coarse grained (>350 um). No IR spectra or 
LIBS compositions consistent with olivine were 
observed in Maaz formation bedrock. Figure S8 
shows the average composition of each ana- 
lyzed rock target, which indicates that the Maaz 
formation is fairly homogeneous and domi- 
nated by varying proportions of plagioclase 
and augite. The average LIBS-based compo- 
sition of Maaz (table S1) is broadly basaltic, 
with 48% SiO. low MgO (3 wt %), and high 
(20 wt %) FeO, (defined as total iron as Fe*”). 

PIXL x-ray fluorescence mapping (33) shows 
(Fig. 5) that the Guillaumes (Roubion member) 
and Bellegarde (Rochette member) abrasion 
patches are dominated by two distinct 0.5- 
to 1-mm-scale common igneous phases. The 
blocky to prismatic white grains (Fig. 3) are 
chemically consistent with plagioclase (Al-rich 
regions in Fig. 5A), whereas the darker grains 
have the composition of pyroxene (specifi- 
cally augite, Ca-rich regions). Also present are 
100-um-scale regions indicative of FeTi oxides 
and Ca phosphates. The red-brown to black 
material consists mostly of Fe and Si, with 
molar Fe/Si ratios ranging from ~0.7 to 1.5 
and MgO and Al,O; both <4%. This compo- 
sition overlaps with some igneous Fe silicate 
minerals, such as Fe-bearing pyroxene and 
olivine, but is also consistent with several sec- 
ondary Al-free Fe silicates, such as greenalite, 
hisingerite, and cronstedtite. Alternatively, this 
material may be a mixture of iron oxide and 
silicates; Mastcam-Z multispectral data indi- 
cate the presence of hematite and/or goethite 
in both Maaz abrasion patches (fig. S4). Coupled 
with its elemental composition, the color and 
textural properties of this material (Fig. 3) 
suggest in situ aqueous alteration of one or 
more primary igneous minerals. 

Raman spectroscopy indicates that the 
bright-white void-filling material in the Maéaz 
formation abrasion patches is at least partially 
composed of hydrous Ca sulfate and Na per- 
chlorate, sometimes colocated with each other 
(fig. S7, A and B). Elemental abundances sup- 
port these identifications; a phase with molar 
Na/Cl ratio of ~1 is consistent with Na per- 
chlorate, halite, or both. 

In the average bulk composition of the 
Maaz abrasion patches (table $2), the abun- 
dances of SO; (~3.0 wt %) and Cl (~2.6 wt %) 
are both far higher than typical unaltered 
martian igneous rocks (38). If Maaz consists of 
igneous rocks, they have experienced addition 
of S- and Cl-bearing materials. To estimate the 
primary chemical compositions of Guillaumes 
and Bellegarde, we computed a subset average 
of PIXL data by excluding salt-rich bright- 
white areas (Fig. 3) (39). This alteration-free 
compositional average (table S2) indicates, on 
the basis of total alkali metals and silica, that 
the primary Maaz formation rocks were basal- 
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Fig. 5. Compositional maps of Guillaumes and 
Dourbes abrasion patches. PIXL x-ray fluores- 
cence maps indicating composition are shown 

in color (as indicated in the legends), with 
concentrations in weight %. These are overlain 

on grayscale visual images indicating rock texture. 
(A) Guillaumes is dominated by grains with 
compositions consistent with plagioclase (high Al, 
corresponding to the light grains visible in Fig. 3A) 
and augite (high Ca and Fe, corresponding to 
dark grains in Fig. 3A). Fe-rich material appears 
brown in Fig. 3A; it could be primary or secondary 
Fe silicates, iron oxides, both, or FeTi oxides and 
sulfates, as these are indistinguishable in this 
representation. (B) Dourbes is dominated by 
euhedral to subhedral olivine of ~Foss composition 
(Fe-rich, corresponding to light-toned grains in 
Fig. 3C). Augite (high Ca, corresponding to dark- 
green grains in Fig. 3C) engulfs olivine, which a 
companion paper interprets as poikilitic igneous 
texture indicating an igneous olivine cumulate rock 
(35). High-Al regions indicate plagioclase. Sulfates 
and carbonates appear as small areas with the 
highest Fe. Figure S9 provides individual maps of 
Ca, Fe, and Al concentrations to aid interpretation of 
colors in this figure. FeO; is total iron as Fe*. 


tic in composition. Compared with Earth basalts, 
Maaz rocks are enriched in FeO; (22 wt %) 
and P.O; (2.2 wt %). Compositions estimated 
by x-ray fluorescence, which represent the 
analysis of just a few square millimeters of 
rock, are similar to the average composition of 
all Maaz formation rocks obtained by LIBS 
(fig. S8). 


Séitah formation geochemistry and mineralogy 


Mastcam-Z multispectral images and SuperCam 
IR spectra of Bastide member rocks of the 
Séitah formation are dominated by a broad 
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1-um absorption feature, indicating olivine 
(fig. S4). IR spectroscopy also shows a 1.9-um 
feature similar to, but weaker than, that ob- 
served in Maaz formation rocks, which we 
attribute to mineral hydration (fig. S6) (37). 
Individual LIBS points on the prominent 2- to 
3-mm gray-green grains observed on natural 
surfaces (Fig. 3C) commonly have composi- 
tions approaching those of pure olivine. The 
target-averaged LIBS compositions of 25 Bas- 
tide member rocks extend from olivine toward 
mixtures of augite and plagioclase (fig. S8). 
The Bastide member compositions have only 
minor overlap with the dominantly plagioclase- 
augite Maaz formation rocks. The overall 
mean composition of all Bastide member tar- 
gets is listed in table S1. 

Raman spectra of the Bastide member ab- 
rasion patches (Dourbes and Garde) confirm 
the abundant presence of olivine (fig. S7C). 
X-ray fluorescence elemental mapping (per- 
formed on Dourbes only; Fig. 5B) indicates 
that 65% of the analyzed surface, associated 
with the light-toned grains (Fig. 3), is chemi- 
cally consistent with approximately Fo;; oli- 
vine (ie., 55% Mg and 45% Fe on an atom 
basis) (35). The olivine is partially surrounded 
with, and sometimes enclosed by, dark-green 
augite (13% of the analyzed surface). A com- 
panion paper (35) interprets this relationship 
as consistent with poikilitic igneous texture 
in which late-forming augite grows around 
density-segregated olivine. Al-rich areas mak- 
ing up 10% of the analyzed surface likely in- 
clude feldspars; 0.1-mm-scale phosphates and 
Cr- and Ti-bearing Fe oxides are also present. 
The reddish-brown material that occurs be- 
tween grains (Fig. 3C) is chemically complex. 
It often has Raman peaks consistent with 
carbonate, frequently associated with olivine 
peaks (fig. S7). In other instances, Raman 
spectra consistent with hydrated sulfate (fig. 
S7) yield Mg/S ratios that indicate the pres- 
ence of MgSO,. In elemental composition 
maps, the brown areas sometimes contain 
low SiOg, elevated abundances of Fe and Mg, 
and low analytical oxide totals, consistent with 
carbonate with an approximate composition 
of Fey ;Mg,;CO3. Raman signatures of the rare 
white patches indicate the presence of hydrated 
Ca and Mg sulfates and perchlorate (fig. $7). A 
broad Raman signature at ~1020 cm”, widely 
distributed in the abrasion patch, is poten- 
tially amorphous silicate (fig. S7G). 

Séitah formation rocks, like those of Maaz, 
appear to consist of primary igneous minerals 
(e.g., olivine, augite, plagioclase) and second- 
ary aqueous alteration phases (carbonate, sul- 
fates). Excluding material that has likely been 
transported from elsewhere (39), the mean 
primary composition obtained by PIXL on 
Dourbes is ultramafic, with 20 wt % MgO and 
40 wt % SiO, (table S2). Compared with the 
Bastide member LIBS average, this composi- 
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tion has lower SiO, and higher FeO but is 
otherwise similar. 

In contrast to the Bastide member targets, 
LIBS measurements of the Content member 
appear to be dominated by augite and feldspar, 
with no olivine. This is consistent with the 
appearance of this rock, which lacks the coarse 
gray-green olivine grains characteristic of the 
Bastide member. Only limited data on this 
member were acquired. 


Interpretations and hypotheses 
Origins of Maaz and Séitah formations 


We interpret the Bastide member of the Séitah 
formation as igneous. The grain size and high 
modal abundance of olivine, coupled with the 
poikilitic texture, indicate an olivine cumulate 
in which pyroxene, plagioclase, and primary 
Fe-Ti-Cr oxides crystallized from residual melt 
(35). These characteristics indicate slow cool- 
ing, differentiation, and crystal settling (40). 
Therefore, Bastide member rocks likely crys- 
tallized in a thick lava flow, a lava lake or im- 
pact melt sheet, or an intrusion. On Earth, 
variations in crystal size or modal mineralogy 
define layers that often give cumulate rocks 
a sedimentary appearance in outcrop (40, 41). 
Imaging, spectroscopic, and LIBS observa- 
tions revealed no evidence for such differences 
among Bastide layers or at their interface. The 
prominently layered appearance of the Séitah 
formation (Fig. 4) could arise from layer- 
parallel cooling joints, differential aqueous 
alteration, or subtle variations in rock compo- 
sition or texture that enhance near-horizontal 
fracturing. Such features are likely to have 
been accentuated by erosional processes, espe- 
cially wind abrasion. This interpretation im- 
plies that the layering in Séitah corresponds to 
magmatic layering. 

Given their mineralogy, texture, and bulk 
composition, rocks of the Maaz formation are 
also most consistent with a primary igneous, 
rather than sedimentary, origin. We exclude 
the possibility that Maaz consists of basaltic 
sandstones owing to the lack of sorting among 
grains of different size and specific gravity, the 
interlocking grain texture, the lack of evidence 
for pervasive intergranular porosity or cements 
in the Guillaumes and Bellegarde abrasion 
patches, and the apparent absence of sub- 
aqueous or aeolian transport features. 

On Earth, the high abundance of olivine in 
olivine cumulate rocks is usually complemented 
by less-mafic (i-e., less Fe and Mg) materials, 
commonly overlying the cumulate, that, when 
combined, constitute the parental melt com- 
position (40). Rover observations within Jez- 
ero crater, and more spatially extensive orbital 
spectroscopy (36), indicate that the high oli- 
vine abundances characteristic of the Bastide 
formation are also found throughout Séitah, 
with olivine-poor rocks (such as the Content 
member) being rare. If a less-mafic comple- 
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ment to the Bastide member previously existed 
in this region, it must have been at least par- 
tially removed by erosion. The Maaz formation 
could be the remnant of such a complement— 
it has abundant pyroxene and plagioclase and 
little magnesium, is moderately coarsely crys- 
talline, and lies stratigraphically above the 
Bastide member, all of which would be ex- 
pected if the two units originated from a single 
differentiated igneous melt. The various mem- 
bers of the Maaz formation could be equiv- 
alent to layers in the igneous body. 

Alternatively, the Maaz formation could be 
a sequence of younger basaltic lavas that flowed 
up against, and at least partially overtopped, 
the Séitah formation after erosional removal 
of the putative less-mafic complement. The 
morphologically distinct Maaz members could 
then be cogenetic lava flows, with variable 
properties related to composition and/or erup- 
tion temperature. An extrusive igneous origin 
for Maaz would account for some of its mor- 
phologies and textures, for example, the lobate 
structures in the Artuby member (Fig. 2B), pos- 
sible vesicles in Guillaumes (Fig. 3A), and in the 
chain of pits in the Rochette member (Fig. 2). 
Variations in elevation of the Maaz-Séitah 
contact could reflect underlying topography 
of the Séitah formation on which the Maaz 
lavas were erupted. However, other features 
are less characteristic of lava flows, such as 
the centimeter-scale layers observed in the 
Rochette and Artuby members and the lack 
of flow orientation of elongated grains in the 
Maaz abrasion patches (Fig. 3). 

The limited data on the Content member 
of the Séitah formation make it difficult to 
define its origin. It could also be a remnant 
of a once more-expansive less-mafic comple- 
ment to Bastide; a remnant of Maaz formation 
lava flows that overtopped the Bastide mem- 
ber; or remnant sedimentary rocks deposited 
atop the Bastide member. 

Multiple sources of melting are possible for 
the igneous crater floor units within Jezero; 
present data are insufficient to discriminate 
between the possibilities. The impact that pro- 
duced Jezero could have generated a suffi- 
ciently deep melt sheet to differentiate into 
an olivine cumulate, but we have found no 
evidence of associated impact breccias on the 
crater floor, and crater-scaling relationships 
suggest that the original Jezero floor is buried 
by ~1 km of postimpact basin-filling materials 
(4, 22). Although Jezero is close to the Isidis 
impact basin, and Bastide member petrology 
is similar to hypothesized olivine-rich cumu- 
lates in a melt sheet potentially created by the 
Isidis impact (3, 5), Isidis cannot be the source 
because Jezero postdates Isidis. There are no 
other impact basins of appropriate age, size, 
and proximity to Jezero to be plausible sources. 
Alternatively, melting could have occurred 
without an impact, especially in the early 
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Hesperian, when the martian mantle was 
substantially hotter than at present—this is 
thought to be the origin of the nearby Syrtis 
Major shield volcano (42). 

Magmas could have been introduced into 
Jezero crater over its rim or from below. Al- 
though the Jezero crater rim has been exten- 
sively eroded, materials like those in the 
Bastide member have been interpreted as 
draping its western side (9), and we suggest 
that hills on the southeast rim of Jezero could 
be of volcanic construction. However, these 
features are not contiguous with the Séitah 
formation. We consider the most likely sce- 
nario to be fissure-fed eruptions occurring 
both within Jezero and in the surrounding re- 
gion. By analogy to some volcanically filled 
craters on the Moon (43) and other craters on 
Mars (44), magma injection could have oc- 
curred directly into the low-density materials 
of the crater floor at any time after Jezero 
formation. This process could lead to intru- 
sive and/or extrusive igneous activity (43). 

Séitah formation rocks were previously pro- 
posed to be part of an ~70,000-km? olivine- 
rich region associated with the Isidis basin 
(5, 18, 45, 46). The widespread distribution 
of this olivine-rich material, apparent drap- 
ing across topography, poor crater retention, 
and limited regolith cover have been inter- 
preted as indicating a clastic origin (5, 78-20). 
That interpretation is difficult to reconcile 
with our conclusion that the Séitah forma- 
tion is a slowly cooled magmatic differentiate. 
Despite their similarities, the regional oli- 
vine unit could have an origin distinct from 
that of Séitah. Alternatively, widespread in- 
jection and differentiation of composition- 
ally similar magmas might have occurred 
throughout the region. 


Aqueous processes 


After emplacement of the igneous rocks on 
the crater floor, multiple forms of aqueous in- 
teraction modified—but did not destroy—their 
igneous mineralogy, composition, and texture. 
Evidence for alteration includes the presence 
of carbonate in the Séitah abrasion patches, 
the iron oxides in the Maaz formation abra- 
sion patches (which we presume are due to 
iron mobilization and precipitation), and the 
deposition of salts, including sulfates and per- 
chlorate. More broadly, the appearance of pos- 
sible spheroidal weathering textures (fig. S1C) 
suggests that aqueous alteration played a role 
in rock disintegration (47). 

The similarity of Fe/Mg ratios in the car- 
bonates and olivine within the Dourbes ab- 
rasion patch, rimming of olivine by carbonate, 
and their possible association with likely 
amorphous silicate are all consistent with 
in situ olivine carbonation, in which olivine 
reacts with CO,-rich water to produce this 
phase assemblage in close physical proximity 
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(48-50). Olivine carbonation often occurs to- 
gether with conversion of olivine to serpentine 
group minerals (serpentinization), but in the 
Séitah formation we have found no evidence 
for such minerals. The similarity of olivine and 
carbonate cation compositions, and the sur- 
vival of substantial abundances of olivine, in- 
dicate aqueous alteration under rock-dominated 
conditions, possibly occurring over a short 
duration. Olivine carbonation can occur over 
a wide range of temperatures, from hydro- 
thermal or metamorphic conditions to low- 
temperature weathering (17). The presence 
of carbonate places a lower bound on fluid 
pH that depends on the concentrations of 
cations and dissolved inorganic carbon (57, 52). 
For example, if Mg and Fe concentrations 
ranged from 10~* to 10°? mol ke and the 
initial dissolved inorganic carbon equilibrated 
with 0.1 to 10 bar CO.(g), the pH would have a 
lower limit of between 5.5 and 7.5 (at 25°C). 

Although we have found no evidence for 
carbonate in the Maaz formation abrasion 
patches, the abundant dark mantling material 
(Fig. 3A) could be a secondary serpentine 
group mineral produced by aqueous altera- 
tion of Fe-rich pyroxene or olivine (53). The 
presence of secondary Fe silicate without high 
abundances of carbonate in Maaz formation 
rocks could indicate a fluid with lower dis- 
solved inorganic carbon concentrations than 
we inferred for the Séitah formation. These Fe 
silicates can also form across a wide range of 
temperatures, from ambient to hydrothermal; 
we cannot constrain the emplacement condi- 
tions without knowing the precise mineral 
composition. Alternatively, the Fe-rich mate- 
rials could represent films of Fe oxides or 
oxyhydroxides coating and penetrating pri- 
mary silicates, such that our elemental mea- 
surements include both phases. On Earth, iron 
staining of rocks is common when reduced 
Fe is released from aqueously altered igneous 
phases and then precipitated under oxidizing 
conditions. For example, sulfide dissolution by 
oxidizing solutions commonly yields hematite 
or goethite, while simultaneously mobilizing 
sulfate (54). 

Sulfate minerals, sodium perchlorate, and 
possibly other salts (e.g., halite) indicate that 
other styles of aqueous activity also occurred 
in the crater-floor igneous formations. These 
salts fill pore spaces, voids, and cracks that 
themselves might have resulted from aqueous 
alteration, for example, from volume expan- 
sion accompanying carbonation and serpen- 
tinization (55), dissolution of phases such as 
sulfides, or by chemical and physical erosion 
processes producing open space (such as the 
pits in Guillaumes). We have not observed 
sulfides, but if they were once present, their 
alteration could have contributed to the pro- 
duction of Ca, Mg, or Fe sulfates or mixtures of 
them in situ. However, the high sulfate concen- 
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trations observed in Guillaumes and Bellegarde 
suggest an external source for at least some 
of the sulfates. As has been proposed for rocks 
at Gale crater investigated by the Curiosity 
rover (56), perchlorate can be introduced into 
near-surface rocks by downward percolation 
and evaporation of brines, possibly associated 
with ephemeral wetting events extending into 
the Amazonian (<3 Ga). The co-occurrence of 
perchlorate and sulfate minerals indicates 
that at least some of the sulfate could have 
been similarly sourced. The diversity of salt 
compositions in the analyzed white patches 
(e.g., their variety of S/Cl and Ca/Mg/Fe ratios) 
indicates that several different fluids could have 
been involved, possibly at different times and 
possibly repeatedly. The extreme solubility of 
perchlorate salts indicates that they are the 
youngest alteration materials in these rocks. 
Salt hydration and dehydration can cause sub- 
stantial volume expansion (57), weakening the 
rock and contributing to the mechanical ero- 
sion and disaggregation of outcrop by aeolian 
processes across the floor of Jezero crater. 

Although there is evidence for rock-dominated 
aqueous alteration and a diversity of second- 
ary phases, there is no geochemical or miner- 
alogical indication of extensive open-system 
aqueous alteration, in which soluble cations 
(Ca, Na, K) are entirely removed, resulting in 
enrichments in aluminous phases (i.e., Al-rich 
clays) (58). The bulk compositions of both the 
Maaz and Séitah formations are consistent 
with near-pristine igneous rocks (except in S 
and Cl; see tables S1 and S82); hydrated alu- 
minosilicates are rare or absent; and there is 
no textural or chemical evidence for extensive 
plagioclase alteration or olivine and pyroxene 
dissolution (59). 


Geologic models 


We propose two scenarios for the history of 
the Jezero crater floor, given our inability to 
distinguish whether the Maaz formation con- 
sists of lava flows (hereafter model 1) or an 
igneous complement to the olivine cumulate 
of the Séitah formation (model 2). 

During or after the formation of Jezero 
crater, a thick magma body or melt sheet (35) 
existed within the crater. This could have been 
emplaced on, or intruded into, a preexisting 
crater fill. Differentiation and crystal settling 
formed the olivine cumulate (35) while also 
producing a less-mafic complement (Fig. 6A). 
In model 1, erosion removed most (or all) of 
this less-mafic material, leaving at least tens of 
meters of topography on the residual olivine 
cumulate deposit (Fig. 6B), after which lava 
flows of the Maaz formation flowed around, 
and at least partially overtopped, the under- 
lying topography (Fig. 6C). Deformation of un- 
known origin then tilted the originally flat-lying 
layers of both formations, at least locally, by 
~10° to the southwest (Fig. 6D), as observed 
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along Artuby ridge. In model 2, both Maaz 
and Séitah were produced by magmatic differ- 
entiation similar to a layered igneous body 
(Fig. 6E). Deformation of unknown origin then 
uplifted Séitah relative to Maaz, tilting both 
(Fig. 6F). 

In both models, the Maaz and Séitah for- 
mations later experienced further erosion that 
removed overlying Maaz formation rocks from 
above the Séitah formation (Fig. 6G). We adopt 
the interpretation (2/, 28) that the crater-floor 
units were then at least partially buried by the 
delta and associated Jezero lake deposits (Fig. 
6H). The late Noachian age (3.6 to 3.8 Ga) (7) 
of the largest river valley networks feeding 
Jezero and the assumed Hesperian age of the 
exposed delta (8) provide a possibly long period 
of aqueous activity during which lacustrine 
and igneous processes both occurred, possi- 
bly driving hydrothermal interactions. How- 
ever, we have found no evidence for remnant 
sedimentary units along Perseverance’s tra- 
verse up to sol 290. If delta or lake sediments 
once covered both the Maaz and Séitah for- 
mations (Fig. 6H), they appear to have largely 
eroded away (Fig. 61), except for delta remnant 
mounds such as Kodiak (60). 

A variety of concurrent and subsequent pro- 
cesses, including rock-dominated alteration and 
salt deposition, modified the chemistry and 
mineralogy of these rocks as they interacted 
with aqueous solutions. Spatially and chem- 
ically distinct materials (carbonate, likely amor- 
phous silicate, Fe oxides, perchlorate, sulfates, 
and possibly Fe silicates) could be the product 
of multiple distinct aqueous events, differing 
in chemistry and possibly separated in time. 
Both hydrothermal waters and groundwater 
could have played a role. 


Habitability and preservation potential 


One proposed definition of a habitable envi- 
ronment is one in which liquid water, available 
energy, nutrients, and other physicochemical 
conditions are adequate for supporting known 
forms of life (67). On Earth, reactions of ig- 
neous rocks with water produce diverse hab- 
itats for microbial life (62, 63). The oxidation 
and partitioning of iron into secondary phases 
during alteration processes, such as carbona- 
tion and serpentinization, can yield H,—a po- 
tential energy source—along with methane 
and other hydrocarbons, which can form raw 
materials for the synthesis of more-complex 
organic molecules (62, 63). The chemical re- 
duction of nitrate, sulfate, and metal ions, 
along with hydrocarbon degradation, is 
thought to power microbial metabolisms in 
Earth’s deep subsurface (64). Rocks of the 
Jezero crater floor appear to share the com- 
positional components of these habitable en- 
vironments. The soluble salts and vein-filling 
minerals we observe, such as iron oxides, per- 
chlorate, and sulfate salts, could have formed 
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under habitable low-temperature conditions 
but could also have formed under nonhabit- 
able conditions (65-67). 

The potential for secondary aqueous phases 
to preserve evidence of life depends on how 
they formed. On Earth, studies of evaporitic 
and desert environments demonstrate the 
presence and preservation of microbial life in 
association with salts (65, 66, 68, 69). Car- 
bonate veins on Earth, produced through 
serpentinization and carbonation, can pre- 


serve biomarkers such as lipids (70-72) and/or 
organic structures interpreted as fossilized mi- 
crobial communities (77). By analogy, we pro- 
pose that the secondary phases in the altered 
igneous rocks of the Jezero crater floor might 
have the potential to record biosignatures, if 
ancient habitable environments existed. 


Sample collection 


Laboratories on Earth have measurement ca- 
pabilities superior to instruments that can 
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Fig. 6. Two possible models for the evolution of the Jezero crater floor. Proposed geological formations 
are indicated by the legend. (A) In model 1, the differentiated magma body experienced (B) erosion that 
removed a less-mafic complement, producing topography on the Séftah formation olivine cumulate. (©) Maaz 
formation lavas surrounded this underlying topography. (D) Deformation of unknown cause tilted both 
Séitah and Maaz, as observed along Artuby ridge. (E) In model 2, flat-lying layers of a differentiated magma 
body (F) deformed in a way that tilted both the olivine cumulate (Séitah formation) and the less-mafic 
complement (Maaz formation) and also uplifted Séitah relative to Maaz. In (G) and (H), common to both 
models, erosion occurred and delta and lacustrine sediments were deposited in the lake that filled Jezero. 
(I) Later, these rock units were eroded to their current surface exposure distribution. Aqueous alteration 
processes could have occurred during any of these steps. 
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25cm 


Fig. 7. Core samples collected for potential return to Earth. (A) Maaz formation 
Rochette member outcrop. The white arrow indicates the sampled rock, part of an 
apparent degrading layer on the crest of Artuby ridge. Enhanced-color Mastcam-Z 
mosaic acquired on sol 180. (B) Close-up of the sampled rock, showing the core 
hole. The Bellegarde abrasion patch is located directly below the hole, partly buried 
by tailings. Perseverance Navigation Camera (Navcam) (75) image acquired on 

sol 190, white-balanced with slight color saturation applied. (©) White-balanced Cache 
Camera (CacheCam) (75) image of the bottom of the core named Montdenier 


be mounted on a Mars rover. An objective of 
the Mars 2020 mission is therefore to collect 
samples, in the form of drilled rock cores, 
for potential transport to Earth by another 
spacecraft in the 2030s (supplementary text). 
We attempted core acquisition on three differ- 
ent rock targets: two in the Maaz formation 
and one in the Séitah formation. The first 
coring attempt occurred near Guillaumes 
(Fig. 1B), in the low-lying, polygonal pavement 
of the Roubion member of the Maaz forma- 
tion. Although the sample collection process 
appeared to execute as expected, the sample 
tube was subsequently found to contain no 
rock or rock fragments. A likely explanation 
is that this rock disintegrated during drilling, 
probably as a result of the rock’s aqueous al- 
teration (Fig. 3A). The sample tube was sealed 
and instead provides a sample of ambient 
martian atmosphere. A second sample acquisi- 
tion attempt was performed on the Rochette 
member on the crest of Artuby ridge near 
Bellegarde, and a third on the Séitah formation 
Bastide member near Dourbes; these were 
both successful (Fig. 7). 

Aqueously altered igneous rocks have ad- 
vantages as samples for return to Earth. The 
formations these samples were taken from 
likely date from the late Noachian to early 
Hesperian; only a small fraction of martian 
meteorites are this old (73). Their crystalliza- 
tion ages, along with the pressures and tem- 
peratures during igneous rock formation, could 
potentially be determined with current labo- 
ratory equipment. Because both the Maaz and 
Séitah formations are thought to extend be- 
neath the delta in Jezero (21, 28), igneous crys- 
tallization ages would provide upper bounds 
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on the age of the lake and the fluvial processes 
that deposited the delta (60), a lower bound 
on the age of the Jezero impact, and an upper 
bound on the age of the smaller craters on 
the Maaz formation. The igneous lithologies of 
the samples might be suitable for paleomag- 
netic analysis of the ancient martian magnetic 
field (74). 
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The geological units on the floor of Jezero crater, Mars, are part of a wider regional stratigraphy of 
olivine-rich rocks, which extends well beyond the crater. We investigated the petrology of olivine 

and carbonate-bearing rocks of the Séitah formation in the floor of Jezero. Using multispectral images 
and x-ray fluorescence data, acquired by the Perseverance rover, we performed a petrographic 
analysis of the Bastide and Brac outcrops within this unit. We found that these outcrops are composed 
of igneous rock, moderately altered by aqueous fluid. The igneous rocks are mainly made of coarse- 
grained olivine, similar to some martian meteorites. We interpret them as an olivine cumulate, formed 
by settling and enrichment of olivine through multistage cooling of a thick magma body. 


ezero crater, the landing site of the Perse- 
verance rover, is located at the western 
edge of Isidis Planitia on Mars (Fig. 1A). 
Orbital data have constrained the cra- 
ter’s age to between 3.82 billion and 
3.96 billion years (J, 2). A light-toned, olivine- 
and carbonate-bearing rock unit exposed on 
the floor of Jezero has been informally named 
the Séitah formation (Fig. 1C) (3-5). Rocks 
similar to Séitah in morphology and miner- 
alogy extend beyond the rim of Jezero; they 
span >70,000 km? from inside Isidis Planitia 
(6), to the neighboring regions of Northeast 


Syrtis (7), Nili Fossae (8), and Libya Montes 
(Fig. 1B) (9, 10). Orbital observations have 
shown that this regional unit has high abun- 
dances of olivine (J1)—an early crystallizing, 
rock-formimg mineral—with large inferred 
grain sizes (several millimeters) and moderate 
magnesium contents of Fo44_¢ [where Fo is 
forsterite (Mg,SiO,) and the subscript num- 
bers indicate its molar percentage in olivine] 
(4, 11). Hypotheses for the unit’s origin include 
an impact melt or intrusive igneous rocks (6), 
basaltic lava flows (9, 12), volcanic ash (2, 13), or 
clastic material (fragmental deposits formed 


by transport and deposition from volcanic 
eruptions, impacts, water, or wind) (14). 
Images of Séitah formation rocks taken by 
the Perseverance rover have shown centimeter- 
to decimeter-scale layering and abundant dark, 
millimeter-scale angular grains, reported in a 
companion paper (5). These observations were 
consistent with any of the above hypotheses 
for the regional unit’s origin. We sought to 
distinguish between these possibilities using 
the rover’s arm-mounted Planetary Instru- 
ment for X-ray Lithochemistry (PIXL) (15). 


PIXL observations 


To remove any modified surface materials, 
two 50-mm-diameter patches (informally 
named Garde and Dourbes) were abraded into 
two rock outcrops (informally named Bastide 
and Brac, respectively). Both outcrops are part 
of the Bastide member (informal name) of the 
Séitah formation (Fig. 1C). Reflectance optical 
images of both patches—illuminated with ultra- 
violet (UV; 385 nm), blue (B; 450 nm), green 
(G; 530 nm), and near-infrared (NIR; 735 nm) 
light-emitting diodes (LEDs)—were acquired by 
using PIXL’s micro context camera (MCC). The 
Dourbes patch was subsequently analyzed to 
produce two x-ray fluorescence (XRF) raster 
maps with areas of 4 by 12.5 mm and 5 by 7mm, 
at a step size of 0.125 mm (Fig. 2) (16). 

We used these data to produce color and 
color ratio images of both patches and esti- 
mated volume percentages of phases in each 
(Fig. 2A, fig. S1, and table S1). For the Dourbes 
patch where XRF maps were acquired, we pro- 
duced diffraction maps of crystallinity and crys- 
tal orientation (Fig. 2C and fig. S2), composite 
maps of elemental abundances (Fig. 2D and 
figs. S3 and $4), and chemical analyses of indi- 
vidual minerals and the bulk rock (tables S2 
to S5) (6). The reconstructed color images 
show that both patches display a similar tex- 
ture, with beige equant grains of sizes ranging 
from 1 to 3.5 mm, greenish-gray irregularly 
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shaped grains, and reddish-brown materials 
rimming these grains or occupying irregular 
areas between them (fig. S1, A and E). These 
features are discernable in color ratio plots 
(Fig. 2A and fig. S1). Elemental maps show 
intergrain areas that contain distinct chemical 
variations, and additional irregular regions 
with high aluminum (Al) (Fig. 2D) and non- 
uniform distribution of sodium (Na), potas- 
sium (K), and phosphorus (P) (figs. S3 and S4). 

Our identifications of olivine and pyroxene 
minerals are supported by stoichiometric anal- 
ysis of the PIXL data and are consistent with 
observations by other Perseverance instru- 
ments (5). We found that the beige equant 
grains have molar (Mg+Fe)/Si = 2 and total 
cations of 2.995 to 3.025 per four oxygens 
(table S2), which is consistent with olivine’s 
chemical formula, (FeMg).SiO,. The greenish- 
gray grains have molar (Ca+Mg+Fe)/Si ~ 1 and 
total cations of 4.032 to 4.042 per six oxy- 
gens (table S3), which is consistent with pyrox- 
ene’s formula (CaFeMg).Si,O,. The molar Ca/ 
(Ca+Mg+Fe) ratios of the pyroxene (0.35 to 
0.37) show that it is the mineral augite (Fig. 3A) 
(17). Back-reflected diffracted x-rays from some 
noncontiguous augite regions show that they 
have identical crystal orientations, indicating 
that these regions are likely to be part of the 
same three-dimensional grain (a single augite 
crystal) (Fig. 2C and fig. $2). This texture, in 
which a large single pyroxene grain encloses 
smaller olivine grains (poikilitic texture), is 
characteristic of igneous cumulate rocks (Fig. 
2B). The irregular intergrain Al-rich areas 
contain intergrown grains of Na-rich feldspar 
(<1 mm), Fe-Cr-Ti oxide (<0.25 mm), K-rich 
feldspar (<0.125 mm), and Ca-phosphate 
(<0.125 mm), which are typical features of 
mesostasis—the last formed materials in the 
intergranular space—in igneous rocks (figs. S3 
and S4). Olivine, augite, and mesostasis are 
the primary minerals in the prealteration bed- 
rocks in the Séitah formation. The intergrain 
reddish-brown materials contain silicates, 
Fe-Mg carbonate, and Fe-Mg-Ca sulfate, which 
are secondary materials from alteration. The 
presence of carbonates was also indicated by 
other instruments on Perseverance (5) and 
consistent with orbital observations. We fo- 
cused our analysis on the primary minerals. 

Olivine is the dominant primary mineral, 
making up 65 + 5 vol % of the Dourbes and 
Garde rocks (fig. S1 and table S1). The re- 
mainder of the analyzed area is made of augite 
at ~13 vol %, mesostasis at ~10 vol %, and sec- 
ondary materials at ~12 vol %. The relative 
mineral proportions of olivine, augite, and 
feldspar (Fig. 3C) indicate that Dourbes is an 
olivine-rich wehrlite, a type of ultramafic ig- 
neous rock (8). 

The chemical compositions of the olivine 
grains (Fo;5.1) are uniform, both within and 
between grains (Fig. 3B and table S2). Olivine 
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Fig. 1. The Dourbes patch in context from outcrop to regional scale. (A) Colorized terrain topography 
map showing location of Jezero crater on west edge of Isidis Planitia. Color scale bar shows altitude 

in meters. (B) Distribution of the regional olivine-bearing unit (green) on an orbital image. Data are from 
(2). (€) Color orbital image showing the location of Brac and the Bastide outcrops in the Séftah formation. 
The dotted white line demarcates the boundary of the Séftah formation outcrops. (D) Mast Camera Zoom 
(Mastcam-Z) image of the Brac outcrop, showing 5- to 10-cm-thick layers and the location of Dourbes patch 


(5 cm diameter). (E) 


astcam-Z image showing more detail of the Dourbes patch. (F) A portion of the 


abraded patch shown by the autofocus and context imager (ACI) of the Scanning Habitable Environments 
with Raman and Luminescence for Organics and Chemicals (SHERLOC) instrument on Perseverance rover 


(5). Image sources are listed in table S7. 


grains contain more Fe than that would be in 
equilibrium with a melt of the rock’s bulk com- 
position (fig. S5 and tables S2 and S5). The 
augite composition is W035_33EN43_44FS20_21, 
expressed as molar percentages of wollastonite 
(Wo; Ca,SigOg), enstatite (En; Mg .Si.O¢), and 
ferrosilite (Fs; Fe2Si,0,) (Fig. 3A). Some re- 
gions in the single large poikilitic augite grain 
we investigated have small variations in Ca 
(<0.7 wt % Ca), Fe (<0.6 wt %), Mg (<0.6 wt %), 
and Cl (~0.3 wt %) (table $3). These could in- 


dicate the presence of inclusions, thin lamellae 
of pyroxene of different compositions, and/or 
minor alteration products that are smaller than 
PIXL’s x-ray beam size. 

Fe/Mn ratios of olivine and pyroxene in me- 
teorites have been used to determine their 
planetary parentage (19). The molar Fe/Mn 
ratios in martian meteorites are 47 + 3 for 
olivine and 32 + 6 for pyroxene (uncertainties 
are 1 standard deviation) (19). The Dourbes 
minerals have ranges of molar Fe/Mn ratios 
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of 55 + 8 to 67 + 9 for olivine and 38 + 13 to 
42 + 3 for augite. The lower ends of these 
ranges are consistent with the values found 
for meteorites; we speculate that the higher 
Fe/Mn values could be due to a small amount 
of Fe-bearing minerals formed from aqueous 
alteration of olivine and augite. 

Mesostasis areas show few signs of second- 
ary salts; they have low abundances of SO, 
and Cl, and all analyzed element abundances 
sum to ~94.0 to 99.5 wt % (table S4), indi- 
cating the presence of little to no additional 
material enriched in C or H, which PIXL can- 
not detect directly. Compositions of the Fe- 
Cr-Ti oxide grains in Dourbes vary, depending 
on their petrographic setting. Those embedded 
in olivine have high molar Cr/Ti (~10), those 
at olivine-pyroxene boundaries have moderate 
Cr/Ti (~3.6), and those in mesostasis have low 
Cr/Ti (1 to 1.9) (fig. S6), leading to the varia- 
tions in their estimated chemistry (table S4). 
Calcium phosphate occurs only in the meso- 
stasis and has molar Ca/P ratios most con- 
sistent with merrillite (9:7) (fig. S7). 


Interpretation as an igneous cumulate 


We used our observations to evaluate the hy- 
potheses mentioned above for the origin of 
olivine-carbonate unit exposed in Séitah. 

Volcaniclastic (for example, pyroclastic de- 
posits) or impact clastic (for example, rock 
fragments cemented by quenched impact melt) 
origins typically produce heterogeneous mix- 
tures of mineral grains and rock fragments, 
embedded in a fine-grained matrix of (or al- 
tered from) ash particles (<50 um). For Brac 
and Bastide outcrops in the Séitah forma- 
tion, there is no evidence of rock fragments 
or a zoned fine-grained texture (Fig. 1). Ob- 
servations of other outcrops also reveal no 
evidence of heterogeneous mixtures (5). We 
therefore exclude this possibility. 

Sedimentary deposition of detrital olivine 
grains with cements is unlikely. The observed 
assemblage of coarse-grain augite and meso- 
stasis with feldspar and Fe-Cr-Ti oxides has 
not previously been observed as a sedimentary 
cement and would not precipitate from fluids 
at near-surface temperatures and pressures. 
These grains are also too coarsely crystalline 
to be a detrital matrix. It is also not likely that 
the entire abrasion patch sits within one rock 
fragment in an extremely coarse-grained water- 
transported clastic rock (a conglomerate) be- 
cause there is no evidence in images of the 
host rock for pebble-sized or larger grain 
boundaries, cements or fine-grained matrix, 
hydraulic sorting, sedimentary structures, ero- 
sion, or channel forms, as would be expected 
for such a deposit (5). 

Instead, we conclude that the olivine and 
augite formed by cooling of a magma body. 
The magma could have arisen from volcan- 
ism or an impact because impact melting can 
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Fig. 2. Cumulate texture of Dourbes. (A) Ratio image taken with the PIXL MCC, showing the G/NIR ratios 
(color bar). Dourbes has a cumulate texture with olivine grains (Ol) and a poikilitic texture (region 

within the white dotted line) with pyroxene (Px). Secondary minerals are labeled “2ndry.” White boxes 
indicate the areas used for x-ray fluorescence scans (figs. S3 and S4), the larger of which is shown in 

(C) and (D). (B) Cross-polarized light image of the Chassigny meteorite for comparison, showing poikilitic 
texture. Labels are the same as in (A). The white circle indicates a melt inclusion (M.I.) in Ol. 

(C) Mineral domain clustering map of the same region shown in (D). Green and magenta are used for 

two different diffraction peaks (7.28 to 7.72 keV and 5.67 to 6.17 keV, respectively). Points with the same 
diffraction peak are shown in the same color. All three regions of Px chemistry display the same diffraction 
peak (all green), suggesting that these regions are from a single pyroxene crystal. Ol grains inside the 

Px grains display different orientations, although all points inside one grain display the same diffraction peak 
(magenta) (16). (D) Chemical map derived from x-ray fluorescence, showing FeO (red), MgO (green), and 
Al,03 (blue) concentrations (grayscale images of each element are provided in fig. S3). Labels indicate 
olivine, pyroxenes, and mesostasis (Meso). White circles indicate M.l.s. This composition map is shown in 


context with other optical images in movie S1. 


fully homogenize the target rock, which would 
then solidify and crystallize very similarly to a 
magma originating from the martian mantle. 
We interpret Brac to be a cumulate igneous 
rock formed by settling and accumulation of 
olivine crystals from a molten mafic magma. 
The observed poikilitic texture is common in 
cumulate rocks on Earth (20) and in several 
martian meteorites (27). The compositional 


trend in Cr/Ti ratios we observed in the Fe- 
Cr-Ti oxide grains is also consistent with oxides 
crystallizing from a cooling magma (fig. S6). 
The similarity in mineralogy (5), textures, color, 
and reflectance spectral properties between 
Brac and Bastide outcrops within the Séitah 
formation indicates that our cumulate lithol- 
ogy interpretation can be extrapolated beyond 
the Dourbes abraded patch. 
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Petrogenetic history 

We next considered the magmatic crystalliza- 
tion and cooling history of the olivine cumu- 
late outcrops in the Séitah formation. Using 
the chemistry of the olivine (fig. $5), we in- 
ferred that the magma was more iron rich, 
with Mg# = 27 to 30 [where M¢+# is the molar 
Mg/(Mg+Fe) x 100] (16), than previously found 
(Mg# = 52 to 67) for Martian meteorites (22, 23). 
We suggest that Séitah cumulate rocks formed 
from a parent magma that had previously lost 
Mg-rich minerals. Brac is different in both min- 
eralogy and bulk chemistry from olivine-rich 
cumulates in Martian meteorites, and also from 
olivine-rich rocks at Gusev crater investigated 
by the Spirit rover (Figs. 3 and 4A) (6). The 
olivine grain size distribution and uniform com- 
position in Brac, as well as the lack of larger 
olivine crystals observed in any Séitah outcrops, 
indicates an initially crystal-free melt (Fig. 5B). 
The presence of similarly sized olivine grains 
indicates an early stage of rapid homoge- 
neous olivine nucleation, due to oversaturation, 
possibly induced by rapid cooling (Fig. 5C). 
Additionally, the presence of melt inclusions 
(typically 100 to 300 um, with one up to 
~600 um) in the Dourbes olivine grains (Fig. 2D 
and fig. S4) suggests early rapid growth of 
olivine, forming crystals with open space be- 
tween their outside faces (hopper or skeletal 
shaped crystals), typical morphology under 
rapid cooling (24). Later, olivine crystal of 
euhedral shapes formed through additional 
olivine growth filling in the open space, before 
or during gravitational settling (Fig. 5C). The 
gravitational settling forms a framework of 
olivine grains of mostly point contact with 
open space between grains (Fig. 5D). There is 
little textural or chemical evidence for later 
olivine growth filling in the open space be- 
tween olivine grains, indicating that the cumu- 
late was unaffected by subsequent magma 
infiltration or exchange after the olivine crystals 
had formed and accumulated. The proportion 
(~35 vol %) of other minerals among the olivine 
is consistent with the expected porosity of an 
olivine cumulate unaffected by compaction (25). 
The chemical equilibrium of olivine and augite 
and the coarse grains of augite imply extended 
cooling at a temperature (~1085°C) below that 
at which olivine starts to crystallize (76). 


Emplacement history 


Olivine cumulates can form in different geo- 
logical settings, including large layered mafic 
intrusions (LLMIs), thick lava flows and ponds, 
shallow intrusive bodies as sills or laccoliths 
(ens-shaped igneous formations), or thick melt 
sheets from large impacts or their melt ejecta 
(Fig. 5A). The emplacement environment of 
cumulate rocks as observed in the Séitah for- 
mation can be assessed by using Earth ana- 
logs, after considering the difference in gravity 
(6). Unlike similar rocks in Earth LLMIs (20), 
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Fig. 3. Mineralogy of Dourbes. (A) Classification of pyroxene by using the percentage compositions of 
wollastonite (Wo), enstatite (En), and ferrosilite (Fs) (17). The average compositions of Dourbes pyroxene 
grains in table S3 are indicated with star symbols. Symbols or fields with solid lines indicate volumetrically 
major pyroxene species, and those with dotted lines are minor pyroxene species. Arrows indicate the 
direction of crystallization. (B) Binary diagram showing mole percentage of forsterite (Fo) in the olivine (Ol). 
The average compositions of Dourbes olivines (table S2) are indicated with star symbols. Dourbes olivine 
and pyroxene differ from martian meteorites. (C) Classification of ultramafic rocks by using volume 
percentages of olivine (Ol), clinopyroxene (Cpx), and orthopyroxene (Opx) (18). Symbols are defined in the 
legend. Source data for meteorites are listed in table S7. 


Séitah olivine grains host large melt inclu- 
sions and show negligible additional growth 
after the olivine framework formed, indicating 
that Brac formed at depths of no more than a 
few kilometers. Thick (100 to 150 m) mafic or 
ultramafic lava flows or ponds on Earth form 
uncompacted frameworks of euhedral to sub- 
hedral olivine, similar to those observed in 
the Séitah formation. However, among Earth 
examples, olivine grains are typically smaller 
and the mesostasis is more fine-grained (or 
glassy) than in Séitah rocks (16). This indicates 
that these martian cumulates experienced 
later cooling at a lower rate than those of 
Earth analogs. We suggest potential explan- 


ations of a thicker lava than on Earth, or em- 
placement into hot surroundings (such as a 
crystal-laden lava injecting into a still-hot 
flow), or an intrusive environment such as a 
sill or a laccolith. Alternatively, similar features 
could have formed in a superheated impact 
melt sheet. 

In those environments, olivine cumulates 
represent only the lower portion of the petro- 
genetic sequence. On Earth, pyroxene-rich 
cumulates commonly occur stratigraphically 
above olivine-rich layers, with a trend of min- 
erals increasing in Fe, Na, and K higher in the 
sequence (Fig. 5D). Similar petrogenetic se- 
quences have been hypothesized for cumulate 
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Fig. 4. Bulk chemistry of Dourbes compared with meteorites and 
igneous rocks elsewhere on Mars. (A) Plot of total alkali oxides as a 
function of SiOz abundance. Fields defined by vertical gray lines indicate 
the classification of volcanic rocks (37). The gray field indicates igneous 
clasts in breccia meteorite NWA 7034 (38). Dourbes contains less SiOz 

and higher alkali oxides than other martian igneous rocks. (B) Weight 
percentage ratios of K20/TiOz2 for Dourbes and other martian igneous rocks. 
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Fig. 5. Schematic diagram of potential emplacement models. (A) Potential 
emplacement models for the Bastide member, which hosts the Brac and 


Bastide outcrops. Color hue schematically indicates the temperature of the melt, 


with darker being cooler. Possible scenarios are an extrusive lava flow or pond, 
an intrusive sill or a small laccolith, or an impact melt sheet or ejecta melt 
ponds. All models provide the range of cooling rates as we infer for the Brac 
outcrop. (B to D) The petrology of our model (not to scale). (B) Initial 
emplacement of melt without any crystals, forming a rapidly quenched top 
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1: Initial emplacement 
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and the Algonquin rock in Gusev crater. Dourbes data are plotted as star 
symbols, with the error bar indicating the estimated lo uncertainty. Dourbes 
data are listed in table S5, and data sources for meteorites and rover data 
are listed in table S7. 


D 4: Accumulation and crystallization 
of interstitial phases 


@® Olivine 

«  Fe-Ti-Cr oxides 
@ High-Ca pyroxene 
@ Low-Ca pyroxene 
tC Feldspar 

© Mesostasis 


increasing Fe 


crust and bottom. (C) Rapid cooling after emplacement causes nucleation and 
growth of hopper or skeletal olivine grains near the top surface. Convection 

of olivine into the (more slowly cooling) interior promotes olivine growth. 

Once each crystal reaches a critical size, it sinks to the bottom. (D) Continued 
cooling, crystallization, and sinking concentrates olivine at the base. Further 
crystallization of low-Ca pyroxene, high-Ca pyroxene, and other minerals 
(indicated in the legend) forms stratified layers, with increasing Fe contents 
higher in the melt body. 


30 SEPTEMBER 2022 * VOL 377 ISSUE 6614 1517 


RESEARCH | RESEARCH ARTICLES 


martian meteorites (26-28). The overlying 
Maaz formation could consist of such rocks 
(5); however, the contact between the Séitah 
and Maaz formations is obscured along the 
rover traverse, preventing a definitive deter- 
mination of their relationship (5). Alterna- 
tively, the originally overlying strata might 
not have been explored by the rover or could 
have been eroded, or there is no petrographic 
link between the Séitah and Maaz formations. 


Regional and planetary context 


The presence of olivine cumulates in Jezero 
crater constrains the origin of the more wide- 
spread olivine-bearing regional units in and 
around Isidis Planitia. On petrologic evidence 
alone, it is most plausible that emplacement 
of the Séitah olivine cumulate was through 
magma injection into an environment that per- 
mitted slow cooling, such as a shallow igneous 
intrusion in the martian crust or into existing 
lava flows (6). However, this interpretation does 
not necessarily extend to the regional scale. A 
single large shallow intrusion seems unlikely 
to explain a regional unit over such a wide 
area (>70,000 km?) with diverse local topo- 
graphical influences. Similarly, the cumulate 
rock does not support the hypothesis that 
the regional unit is formed by a single deposit 
of clastic material (volcanic ash, impact ash, 
or sediments). We suggest that the regional 
exposure represents a series of related intru- 
sive (sills and laccoliths) and extrusive magma 
bodies (lava flows and pyroclastic deposits). 
It could also contain some reworked sedi- 
mentary derivatives of the igneous rocks. 
Abundances of incompatible elements [those 
that preferentially partition into the melt, such as 
Na, K, rubidium (Rb), strontium (Sr), lantha- 
num (La), samarium (Sm), neodymium (Nd), 
ytterbium (Yb), uranium (U), and thorium (Th)] 
in igneous rocks constrain the mantle source 
compositions. Ratios of rare earth elements 
(for example, La/Yb) are often used. However, 
these elements are of low abundances (typi- 
cally at parts per million or parts per billion 
by weight) that are below PIXL’s detection 
limit. The abundance ratio of K,O/TiO, can 
be used instead to constrain source charac- 
teristics, provided that the magma has not 
experienced crystallization and removal of 
K- or Ti-bearing minerals (29). We compared 
our results with meteorites and igneous rocks 
studied by other rovers. The weight K,0/TiO. 
ratio (0.38 + 0.48) of the unaltered portion 
of Dourbes falls in the range found in mar- 
tian meteorites containing olivine cumulates 
(chassignite) or augite cumulates (nakhlite), 
as well as mafic rocks investigated by other 
Mars rovers and landers (Fig. 4B). All of these 
samples are enriched in highly incompatible 
elements (such as K, U, and Th) compared 
with Ti (30). Measurements of Sm-Nd and 
Rb-Sr isotopes in nakhlites and chassignites 
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have indicated that their mantle sources were 
depleted in highly incompatible elements; this 
suggests enrichment of highly incompatible ele- 
ments in these cumulate meteorites during melt 
separation from the mantle sources (26, 28). 
A process that enriches highly incompatible 
elements—possibly driven by interaction with 
high-temperature aqueous fluids after rock for- 
mation (metasomatic processes)—has previ- 
ously been suggested for the mantle source 
regions of meteorites (37), alkali-rich basalts 
in Gusev crater (32) and Gale crater (33, 34). 
Alternatively, geophysical data and models 
have been interpreted as indicating that Mars’ 
mantle is depleted overall in radioactive heat- 
producing (K, U, and Th) and other highly 
incompatible elements, implying that those ele- 
ments are instead concentrated in the planet’s 
crust or upper mantle (35). Elevated abun- 
dances of highly incompatible elements in the 
mantle source regions at Gusev, Gale, and 
Jezero craters, as well as those for chassig- 
nites and nakhlites, could indicate a general 
enrichment of such elements in Mars’ upper 
mantle, possibly caused by metasomatic pro- 
cesses and/or plume activity. These differ- 
ent samples range from a formation age of 
>3.9 billion to 1.34 billion years (3, 36). This 
wide range of ages and geographic locations 
of igneous rocks with these properties indicates 
a long-lived process that has been operating 
on Mars for billions of years and has affected 
a large fraction of Mars’s crust and upper 
mantle (16). 
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Oncometabolite p-2HG alters T cell metabolism to 
impair CD8* T cell function 


Giulia Notarangelo’, Jessica B. Spinelli't, Elizabeth M. Perez”*, Gregory J. Baker™®, Kiran Kurmi?, 
Ilaria Elia‘, Sylwia A. Stopka”®, Gerard Baquer’®, Jia-Ren Lin®, Alexandra J. Golby’, Shakchhi Joshi’, 
Heide F. Baron’°”, Jefte M. Drijvers'!2§, Peter Georgiev”, Alison E. Ringel’, Elma Zaganjor'#, 
Samuel K. McBrayer™®, Peter K. Sorger®, Arlene H. Sharpe~”, Kai W. Wucherpfennig??2"4", 
Sandro Santagata>*"®, Nathalie Y. R. Agar”®”, Mario L. Suva?, Marcia C. Haigis!* 


Gain-of-function mutations in isocitrate dehydrogenase (IDH) in human cancers result in the production 
of p-2-hydroxyglutarate (p-2HG), an oncometabolite that promotes tumorigenesis through epigenetic 
alterations. The cancer cell-intrinsic effects of p-2HG are well understood, but its tumor cell-nonautonomous 
roles remain poorly explored. We compared the oncometabolite p-2HG with its enantiomer, .-2HG, and 
found that tumor-derived p-2HG was taken up by CD8* T cells and altered their metabolism and antitumor 
functions in an acute and reversible fashion. We identified the glycolytic enzyme lactate dehydrogenase 
(LDH) as a molecular target of p-2HG. p-2HG and inhibition of LDH drive a metabolic program and immune 


CD8* T cell signature marked by decreased cytotoxicity and impaired interferon-y signaling that was 
recapitulated in clinical samples from human patients with [DH1 mutant gliomas. 


he discovery of mutations in genes en- 
coding key metabolic enzymes has high- 
lighted a direct link between altered 
metabolism and disease. In particular, 
the discovery of mutations in fumarate 
hydratase, succinate dehydrogenase, and iso- 
citrate dehydrogenase (JDH), all enzymes of 
the tricarboxylic acid cycle, has uncovered 
how mitochondrial metabolites can accu- 
mulate and act as disease-driving signaling 
molecules, becoming bona fide oncometab- 
olites (7). Beyond their established intracel- 
lular functions, oncometabolites have tumor 
cell-nonautonomous roles and mediate in- 
tercellular cross talk in the tumor microen- 
vironment (TME). 
The oncometabolite p-2-hydroxyglutarate 
(p-2HG) is produced by cancers with gain- 


of-function mutations in JDHI and IDH2 and 
accumulates to levels up to ~30 mM in the 
surrounding TME (2-8). p-2HG is an inhibitor 
of many alpha ketoglutarate (aKG)-dependent 
dioxygenases, leading to slow accumulating 
alterations in the epigenetic landscape of cells 
that result in oncogenic transformation (9-72). 
Despite our understanding of the cancer cell- 
intrinsic regulation of epigenetics by p-2HG, the 
tumor cell-nonautonomous effects of p-2HG 
in the TME still remain poorly understood. JDH- 
mutant gliomas have fewer tumor-infiltrating 
lymphocytes, including T cells, compared with 
their wild-type (WT) counterparts, suggest- 
ing that p-2HG promotes the establishment 
of an immunosuppressive TME (7, 13-16). How- 
ever, the mechanisms by which p-2HG alters 
antitumor immunity remain unknown. 


We focused on CD8* T cells, which are a 
major component of the adaptive immune 
response and specialize in the direct killing of 
malignant cells, to investigate the impact of 
tumor-derived p-2HG on antitumor immunity. 


Results 

Acute and reversible inhibition of CD8* T cell 
proliferation, cytotoxicity, and interferon-y 
signaling by o-2HG 

To evaluate the tumor cell-nonautonomous 
mechanisms by which p-2HG influences anti- 
tumor immunity, we examined the effects of 
the oncometabolite on the function of mu- 
rine CD8* T cells. We compared p-2HG with 
its enantiomer, t-2HG, a metabolic by-product 
of enzyme promiscuity that supports T cell 
survival and function (17, 18). To determine 
whether pathophysiological concentrations 
of p-2HG in the TME of JDH-mutant tumors 
(2-8) permit cellular uptake by CD8* T cells, 
we treated activated CD8* T cells with various 
doses of non-cell-permeable p-2HG and L-2HG 
and monitored uptake by liquid chromatography- 
mass spectrometry. Both enantiomers were 
taken up rapidly and in similar quantities when 
supplied in millimolar concentrations (Fig. 1A 
and fig. S1A). Next, we quantified the intra- 
cellular levels of each enantiomer. When 20 mM 
p-2HG or 1-2HG was added, we estimated that 
the intracellular concentration of 2HG reached 
low millimolar levels (Fig. 1B). At this concen- 
tration, neither enantiomer had a detrimental 
effect on cell viability over a 6-day time course 
(fig. SIB). 

Upon entering the TME of JDH-mutant 
tumors, infiltrating CD8* T cells undergo local 
expansion and reactivation in the presence of 
high amounts of p-2HG. Therefore, we eval- 
uated the effects of p-2HG or t-2HG on the 
activation and proliferation of CD8* T cells. 
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After stimulation with anti-CD3/CD28 anti- 
bodies, CD8* T cells fully activated in the 
presence of either enantiomer, as assessed 
by cell surface expression of the activating 
markers CD69 and CD25 (fig. S1C). Although 
both enantiomers achieved similar intracellular 
concentrations, p-2HG was a more potent in- 
hibitor of T cell proliferation than L-2HG (Fig. 
1C). The inhibitory effects of p-2HG on pro- 


liferation were dose dependent and observed 


only at concentrations sufficient for cellular 
uptake (fig. SID). Removing p-2HG from the 
culture medium alleviated this proliferative block 
(Fig. 1D and fig. SIE), demonstrating that the 
effects of p-2HG on proliferation are reversible. 
CDs* T cells can eliminate target cells through 
exocytosis of preformed lysosomes containing 
cytotoxic proteins such as granzyme B (Fig. 1E) 
(19). After T cell activation, intracellular gran- 
zyme B levels were decreased in the presence of 


p-2HG (Fig. IF). During the process of degranu- 
lation, the membrane of granzyme-containing 
lysosomes fuses with the plasma membrane 
of activated CD8* T lymphocytes to allow for 
cargo release (20). To evaluate degranulation, 
we assessed the cycling of lysosomal-associated 
membrane protein 1 and 2 (LAMP-1 and LAMP-2, 
also known as CD107a and CD107b) to the cell 
surface in response to T cell receptor signaling 
(21). CD8* T cells activated in the presence of 
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Stained, naive cells (shaded gray) that did not proliferate are 
shown. Right: Division index corresponds to the average number 
of cell divisions that a cell in the total cell population has 
undergone (n = 3). (D) CellTrace Violet dilution assay according 
to experimental design described in fig. SIE to assess 
eversibility of proliferation phenotype. (E) Schematic of a cytotoxic CD8* T cell 
eleasing granzyme B and IFN-y. (F) Intracellular granzyme B levels of CD8* 

T cells activated by CD3/CD28 mAbs in the presence of 20 mM o-2HG, 20 mM 
L-2HG, or left untreated for 3 days. (G) Percentage degranulation as assessed 
by CD107a/b staining of CD8* T cells activated by CD3/CD28 mAbs and 
estimulated by CD3 mAb in the presence of 20 mM p-2HG, 20 mM L-2HG, 

or left untreated. In the washout condition, CD8* T cells were activated in the 
presence of 20 mM p-2HG, and the oncometabolite was subsequently washed 
out before restimulation. In the acute condition, o-2HG was added solely at the 
time of restimulation. In all other conditions, the metabolites were kept for the 
entirety of the assay (n = 3). (H) Percentage of IFN-y* CD8* T cells after 
intracellular cytokine staining of CD8* T cells activated with phorbol myristate 
acetate (PMA) and ionomycin for 4 hours in the presence of 20 mM p-2HG, 

20 mM L-2HG, or left untreated (n = 3). (I) Mean fluorescent intensity for IFN-y 
after intracellular cytokine staining of CD8* T cells activated with PMA and 


Control 
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Log, FC (D-2HG/Control) 
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ionomycin for 4 hours in the presence of 20 mM d-2HG, 20 mM L-2HG, or left 
untreated (n = 3). (J) IFN-y levels in the medium of CD8* T cells activated 
with 20 mM p-2HG, 20 mM L-2HG, or left untreated for 24 hours (n = 3). 

(K) Volcano plot showing distribution of the top down-regulated and 
up-regulated genes in CD8* T cells activated in the presence of 20 mM 
o-2HG or left untreated. Statistically significant ISGs with a fold change 
>1.5 are marked in pink (n = 3). (L) Antigen-specific killing of B16 
ovalbumin-positive tumor cells by OTl CD8* T cells that were activated in 
the presence of 20 mM b-2HG, 20 mM L-2HG, or left untreated. In the 
washout condition, o-2HG was removed before co-culture seeding of tumor 
cells and T cells. In the acute condition, p-2HG was added at the time of 
tumor cell and T cell co-culture. In all other conditions, the metabolites 
were kept for the entirety of the assay (n = 3). *P < 0.05, **P < 0.01, 
***P < 0.001, ****P < 0.0001 (one-way and two-way ANOVA). Data are 
representative of at least two independent experiments. 
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p-2HG failed to degranulate upon antigen re- 
stimulation in a dose-dependent manner (fig. 
S1, F, G, and I, and Fig. 1G). The total amounts 
of intracellular LAMP-1 were not affected by 
p-2HG treatment (fig. S1H). Given these ob- 
servations, we then investigated whether the loss 
of LAMP-1/2 surface expression could be at- 
tributed to an impairment in granule formation 
at the time of ex vivo activation (day 0) or a 
defect in degranulation at the time of restim- 
ulation (day 6). To test this, CD8* T cells were 
activated in the presence of p-2HG for 6 days, 
and then p-2HG was removed at the time of 
restimulation. Washing away p-2HG at the 
time of restimulation partially restored the 
ability of CD8* T cells to degranulate (fig. S1, 
F and I, and Fig. 1G, washout condition). Con- 
versely, CD8* T cells that were activated for 
6 days and subsequently exposed to p-2HG 
only at the time of restimulation failed to de- 
granulate (fig. S1, F and I, and Fig. 1G, acute 
condition). These data indicate that p-2HG 
leads to acute defects in CD8* T cell degran- 
ulation by affecting processes involved in the 
formation of lytic effectors and the release of 


lytic contents. 


Together with granzyme B, CD8* T cells also 
secrete interferon-y (IFN-y), a cytokine that 
sensitizes tumor cells to killing by T cells (Fig. 
1E) (22). After stimulation, p-2HG-treated CD8* 
T cells exhibited a dose-dependent defect in 
IFN-y production, as determined by decreased 
IFN-y intracellular staining and reduced re- 
lease of the cytokine into the medium (Fig. 1, 
H to J, and fig. S1J). IFN-y controls specific 
gene expression programs of hundreds of genes 
(23). Therefore, we performed gene expression 
profiling by RNA sequencing of CD8* T cells 
activated in the presence of p-2HG or t-2HG 
or left untreated. Gene Ontology (GO) analysis 
showed that most of the down-regulated genes 
upon p-2HG treatment belonged to the class 
of IFN-y-stimulated genes (ISGs) (Fig. IK and 
fig. SI, K and L). Thus, CD8* T cells exhibit a 
decrease in the IFN-y-induced signature, re- 
flecting a decrease in IFN-y production and 
secretion after p-2HG treatment. 

To assess whether the decreased production 
of cytotoxic proteins and effector cytokines 
had functional consequences, we probed the 
antigen-specific antitumor killing capacities 
of p-2HG- or t-2HG-treated cells in an in vitro 


killing assay. p-2HG-treated CD8* T cells ex- 
hibited poor killing of cancer cells both when 
p-2HG was provided starting at the time of 
their activation and when it was administered 
to activated CD8* T cells only at the time of 
co-culture with cancer cells (Fig. 1L and fig. 
S1, M to O). Thus, the effects of p-2HG on 
cytotoxicity are rapid and independent of up- 
stream events involving T cell activation. Re- 
moval of p-2HG at the time of co-culture from 
CD8* T cells that had been activated in the 
presence of p-2HG restored their antitumor 
killing capacities (Fig. IL and fig. SIM, wash- 
out condition), indicating that p-2HG must be 
present at the time of tumor cell recognition 
to impair their killing. Thus, we conclude that 
p-2HG acutely impairs the antitumor killing 
properties on CD8* T cells. 


p-2HG alters glycolysis in CD8* T cells 


Because of their acute and reversible nature, 
the effects of p-2HG on CD8* T cell prolifer- 
ation, cytokine production, and cytotoxicity 
appear to be independent of its established 
role as an epigenetic regulator. In support of 
this, acute treatment with p-2HG did not 
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Fig. 2. p-2HG alters glycolysis in CD8* T cells. (A) Schematic of experimental 
design to assess the acute (24 hours) effects of 20 mM p-2HG and 20 mM 
.-2HG on steady-state metabolite levels in activated CD8* T cells. (B) Logs fold 
changes of glycolytic intermediates in 20 mM v-2HG-treated versus 20 mM 
\-2HG-treated CD8* T cells relative to control. F6P, fructose 6-phosphate; G6P, 
glucose 6-phosphate; DHAP, dihydroxyacetone phosphate; GASP, glyceraldehyde 
3-phosphate; PEP, phosphoenolpyruvic acid (n = 3). (C) Logs fold changes 

of key glycolytic metabolites that are secreted into or consumed from the 
medium of p-2HG-treated, .-2HG-treated, or untreated CD8* T cells (n = 3). 
(D) Schematic of expected incorporation of heavy carbons into glycolytic 
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intermediates after °C, glucose is provided for 24 hours. (E) lon intensities of 
glycolytic intermediates and their respective °C-isotopologs after 24-hour 
cotreatment of °C, glucose with 20 mM p-2HG, 20 mM L-2HG, or control 

(n = 3). (F) Levels of secreted pyruvate isotopologs in the medium of 20 mM 
o-2HG-treated, 20 mM .-2HG-treated, and untreated CD8* T cells after 
providing $C, glucose for 24 hours (n = 3). (G) Levels of secreted lactate 
isotopologs in the medium of 20 mM p-2HG-treated, 20 mM t-2HG-treated, and 
untreated CD8* T cells after providing °Cg glucose for 24 hours (n = 3). 

*P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 (two-way ANOVA). Data 
are representative of at least two independent experiments. 
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Fig. 3. p-2HG inhibits LDH-A activity in vitro and in CD8* T cells. 

(A) Schematic of LDH reaction. (B) Intracellular lactate (M+3)/pyruvate (M+3) 
ratio of CD8* T cells after 24-hour cotreatment of SC, glucose with 20 mM 
o-2HG or control (n = 3). (€) Schematic of the metabolic consequences 

of antimycin A (AA) treatment on glucose catabolism. 
(M+3)/pyruvate (M+3) ratio after brief 20 mM p-2HG pretreatment, followed by 
AA and °C, glucose cotreatment (n = 3). Solid bars, —AA; striped bars, +AA. 
(E) In vitro enzymatic assessment of 3 mM L-2HG, 3 mM p-2HG, 80 uM oxamate, 


result in epigenetic reprogramming of T cells 
(fig. S2A). We therefore investigated whether 
p-2HG had an effect on metabolism. First, we 
tested whether either enantiomer could be 
metabolized. The chiral-specific reduction of 
2HG to aKG is mediated by p-2HG and 1-2HG 
dehydrogenase (p-2HGDH and 1-2HGDH, re- 
spectively) (24). In agreement with the litera- 
ture (25), Western blot analysis demonstrated 
undetectable amounts of p-2HGDH and 
L-2HGDH in CD8* T cells (fig. S2B). Further- 
more, stable isotope tracing of °C; p-2HG or 
L-2HG did not show evidence of t-2HG and 
p-2HG metabolism over a 48-hour time course, 
despite achieving steady-state isotopic enrich- 
ment (fig. S2, C to E). We conclude that p-2HG 
and 1-2HG are not readily metabolized in CD8* 
T cells. 

Given that p-2HG and L-2HG led to distinct 
functional effects on CD8* T cells, we per- 
formed steady-state metabolomics of intra- 
cellular metabolites and metabolites from cell 
culture medium after acute 2HG treatment 
with the goal of identifying metabolic path- 
ways that were differentially altered by the 
two enantiomers (Fig. 2A). The metabolic 
pathway most altered upon p-2HG treatment 
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(D) Intracellular lactate 


was glycolysis. p-2HG, but not L-2HG, pro- 
moted the accumulation of many glycolytic 
intermediates (Fig. 2B and fig. S2F). Consis- 
tent with altered glucose utilization, we also 
observed increased pyruvate and decreased 
lactate in the medium of p-2HG-treated CD8* 
T cells (Fig. 2C and fig. S2G). t-2HG stabilizes 
hypoxia-inducible factor-lo. (HIF-1a), which is 
a master transcriptional regulator of glycolysis 
(17, 26, 27). Therefore, we assessed whether 
the reported changes in glucose catabolism 
could be attributed to p-2HG-mediated HIF- 
la regulation. We found that p-2HG, unlike its 
enantiomer 1-2HG (28), did not promote HIF- 
la stability in T cells (fig. S2H). This result was 
further confirmed by quantitative polymerase 
chain reaction analysis, showing no major 
changes in the expression of genes involved 
in glucose utilization upon pv-2HG treat- 
ment (fig. S21). Thus, we conclude that p-2HG 
alters glucose catabolism through an HIF-lo- 
independent mechanism. 

To further investigate how glycolytic flux is 
altered in the presence of p-2HG and 1-2HG, we 
performed tracing experiments with C, glucose 
(Fig. 2D). Consistent with our steady-state meta- 
bolomic profiles, we observed increased label- 


and 10 nM GSK2837808A on LDH-A activity. Oxamate and GSK2837808A are 
known LDH inhibitors and were used as a control (n = 3). (F) Lineweaver-Burk 
plot for p-2HG. (G) NAD*/NADH ratio of CD8* T cells treated for 24 hours 

with control, 1 mM nicotinamide mononucleotide (NMN), 20 mM o-2HG, or 

20 mM 1-2HG. NMN raises intracellular NAD* levels and was therefore used as a 
positive control (n = 3). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 
(Student's t test, one-way and two-way ANOVA). Data are representative of 

at least two independent experiments. 


ing in glycolytic intermediates preceding lactate 
and a decrease in labeling of lactate (Fig. 2E). 
These results were confirmed by metabolite 
analysis in the cell culture medium. We detected 
increased pyruvate (M+3) accumulation and de- 
creased lactate (M+3) secretion upon p-2HG 
treatment (Fig. 2, F and G). Collectively, these 
findings demonstrate that p-2HG, and not 
L-2HG, strongly inhibits glycolysis in CD8" T cells. 


p-2HG inhibits LDH activity in CD8* T cells and 
lowers the NAD*/NADH ratio 


Pyruvate to lactate interconversion is cata- 
lyzed by the enzyme LDH (Fig. 3A) (29). The 
dose-dependent decrease in pyruvate to lactate 
conversion, as measured by the ratio of lactate 
(M+3) to pyruvate (M+3) upon Cg, glucose 
tracing (Fig. 3B and fig. S3A), led us to test 
whether LDH activity is impaired in the 
presence of p-2HG. We treated CD8* T cells 
with antimycin A, an inhibitor of the electron 
transport chain (ETC) that limits mitochondria’s 
ability to generate ATP and increases reliance on 
glycolysis to support the bioenergetic needs of 
a cell (Fig. 3C) (30). In response to antimycin 
A, activated CD8* T cells induced pyruvate to 
lactate conversion, as indicated by an increase 
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Fig. 4. Cytosolic NAD(H) imbalance drives mitochondrial membrane 
hyperpolarization in p-2HG-treated CD8* T cells. (A) Relative contribu- 
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tion of mitochondrial oxidative phosphorylation and glycolysis to ATP 
production in response to p-2HG after 24 hours of treatment (n = 10). 


(B) Quantification of basal respiration in response to 20 mM o-2HG after 
24 hours of treatment (n = 10). (C) Mitochondrial membrane potential as 
assessed by tetramethylrhodamine ethyl ester (TMRE) fluorescence in CD8* 
T cells treated with 20 mM p-2HG, 20 mM L-2HG, or control for 3 days. 

(D) TMRE assay according to experimental design described in fig. SAF. 

(E) Mitochondrial membrane potential as assessed by TMRE fluorescence in 
CD8* T cells treated with 20 mM p-2HG, 20 mM 1-2HG, or control for 2 hours. 
(F) Schematic of targets of rotenone, AA, oligomycin, and FCCP. (G) Effects 


of inhibition of ETC complexes on mitochondrial membrane potential in 


control and p-2HG-treated CD8* T cells (n = 3). (H) Quantification of spare 
respiratory capacity in response to 20 mM p-2HG after 24 hours of treatment 


(n = 10). (I) Schematic of targets of AOA. (J) Mitochondrial membrane 
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potential as assessed by TMRE fluorescence 
20 mM 0-2HG or control for 24 hours in the presence or absence of 1 mM 


in CD8* T cells treated with 


AOA (n = 3). (K) Schematic of LbNOX mechanism of action. (L) Western blot 
of CD8* T cells overexpressing empty vector (EV) or FLAG-tagged cytosolic 


LbNOX. (M) NAD*/NADH ratio of CD8* T ce 
LbNOX after a 24-hour-long treatment with 
(N) TMRE staining of EV or LbNOX-overexp 
with 20 mM p-2HG or left untreated for 24 
the target of rotenone. (P) Quantification of division index of CD8* T cells activated 
in the presence or absence of 20 mM p-2HG for 3 days and cotreated with 

1 nM rotenone (n = 3). (Q) Quantification of intracellular ROS levels as assessed by 
CellROX staining in CD8* T cells treated with 20 mM o-2HG or left untreated for 


Is overexpressing EV or cytosolic 
20 mM 0-2HG or control (n = 3). 
essing CD8* T cells treated 

hours (n = 3). (0) Schematic of 


1 day. Hydrogen peroxide (H202, 100 mM) and 10 mM NAC were used as positive 


and negative controls, respectively (n = 3). *P < 0.05, **P < 0.01, ***P < 0.001, 
****P < 0.0001 (Student's t test, one-way and two-way ANOVA). Data are 


representative of at least two independent experiments. 
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in the lactate/pyruvate ratio. By contrast, this 
induction was significantly suppressed in 
p-2HG-treated CD8* T cells (Fig. 3D), sup- 
porting the model that LDH function is com- 
promised in the presence of p-2HG. 

To determine whether p-2HG is a direct 
inhibitor of LDH activity, we used purified 


recombinant LDH-A protein, the LDH iso- 
zyme expressed by T cells, and measured its 
activity in the presence or absence of p-2HG 
or L-2HG. Using the intracellular 2HG con- 
centration previously determined (3 mM; Fig. 
1B), we observed that p-2HG strongly inhib- 
ited LDH-A activity in vitro (Fig. 3E and fig. 


$3, B and C). LDH-B was also inhibited by 
p-2HG, suggesting that the inhibitory proper- 
ties of p-2HG are not isozyme specific (fig. S3D). 
Although our tracing studies did not provide 
strong evidence for t-2HG-mediated inhibi- 
tion of LDH, our in vitro kinetic assay revealed 
that L-2HG also inhibits, albeit less effectively, 
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Fig. 5. LDH inhibition recapitulates the effects of p-2HG on CD8* T cell 
metabolism, proliferation, cytotoxicity, and IFN-y signaling. (A) Lactate 
(M+3)/pyruvate (M+3) ratio in CD8* T cells treated with 20 mM p-2HG, 20 mM 
oxamate, 10 uM GSK2837808A, or left untreated for 24 hours (n = 3). (B) NAD*/ 
NADH ratio of CD8* T cells after a 24-hour-long treatment with 20 mM p-2HG, 

20 mM oxamate, 10 uM GSK2837808A, or control (n = 3). (C) Quantification 
of basal respiration in response to 20 mM b-2HG, 20 mM oxamate, 10 uM 
GSK2837808A, or control after 24 hours of treatment (n = 10). (D) Left: 
Mitochondrial membrane potential as assessed by TMRE fluorescence in CD8* 
T cells treated with 20 mM p-2HG, 20 mM oxamate, 10 uM GSK2837808A, 

or control for 24 hours. Right: quantification of mean fluorescence intensity for 
TMRE (n = 3). (E) Left: CellTrace Violet dilution assay at day 3 of CD8* 

T cells activated in the presence of 20 mM p-2HG, 20 mM oxamate, 10 uM 
GSK2837808A, or left untreated. Right: quantification of division index (n = 3). 
(F) Quantification of intracellular granzyme B expression in CD8* T cells 
activated in the presence of 20 mM p-2HG, 20 mM oxamate, 10 uM 
GSK2837808A, or control for 3 days (n = 3). (G) Percentage degranulation 
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assessed by CD107a/b staining in CD8* T cells activated in the presence of 

20 mM 0-2HG, 20 mM oxamate, 10 uM GSK2837808A, or left untreated (n = 3). 
(H) Percentage of IFN-y* CD8* T cells after intracellular cytokine staining of 
CD8* T cells activated with PMA and ionomycin for 4 hours in the presence of 
20 mM 0-2HG, 20 mM oxamate, 10 uM GSK2837808A, or left untreated (n = 3). 
(I) Mean fluorescent intensity for IFN-y after intracellular cytokine staining of 
CD8* T cells activated with PMA and ionomycin for 4 hours in the presence 

of 20 mM p-2HG, 20 mM oxamate, 10 uM GSK2837808A, or left untreated 

(n = 3). (J) Cell number-normalized levels of IFN-y in the cell culture medium 
of CD8* T cells activated in the presence of 20 mM p-2HG, 20 mM oxamate, 
10 uM GSK2837808A, or control for 2 days (n = 3). (K) Relative mRNA 
expression of ISGs in CD8* T cells activated in the presence of 20 mM 0-2HG, 
20 mM oxamate, 10 uM GSK2837808A, or control for 24 hours (n = 3). 

(L) Specific killing of B16 ovalbumin-positive tumor cells by OT1 CD8* T cells 
activated in the presence of 20 mM p-2HG, 20 mM oxamate, 10 uM GSK2837808A, 
or control (n = 3). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 (one-way 
ANOVA). Data are representative of at least two independent experiments. 
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Fig. 6. Altered metabolic and cytotoxicity signatures in [DH1-mutant rela- 
tive to IDH1"" cancers. (A) Schematic depicting experimental setup. 

(B) Quantification of 2HG levels by liquid chromatography—mass spectrometry in the 
tumor interstitial fluid (TIF) from /DH1™™ and IDHIR'S*# 
(n = 10 to 12). (C) Quantification of the percentage of CD8* T cells (of 
CD3*CD45* cells) in spleen and tumor of IDH1"" and IDHIS24 MC38 and B16 
syngeneic tumor mouse models (n = 6 to 10). (D) Quantification of the 
percentage of Ki67* CD8* T cells in spleen and tumor of IDHI”" and IDHIRS2H 
MC38 and B16 syngeneic tumor mouse models (n = 6 to 10). (E) Quantification 
of the percentage of IFN-y*, IL-2*, and TNF-a*CD8* T cells isolated from 
IDH1" or IDH1®'3*4 MC38 or B16 tumors on day 16 after inoculation and after 
ex vivo restimulation with PMA and ionomycin for 4 hours in the presence of 0 or 
2.5 mM b-2HG to recapitulate the 2HG levels measured in the TIF (n = 8). 

(F) MSl-based quantification of 2HG levels in tissue sections of human /DHI®3*4 
gliomas subdivided by grade. The upper limit of 2HG quantification based on 
the calibration curve was 30 mM; therefore, any value beyond that was collapsed 
to 30 mM. A, astrocytoma; OD, oligodendroglioma; I-Il, low-grade; IIl-IV, 
high-grade. (G) Per-section average lactate ion counts across IDH1"’ GBM 

(n = 5) and IDHI*'324 (n = 4) astrocytoma, grade IV patients. (H) Per-pixel 
actate and 2HG ion counts across all sections from case #3, an IDHI"' GBM 
patient, and case #19, an IDH1*'34 glioma patient. Sections were annotated 
based on high cellularity (HC), low cellularity (LC), nontumor (NT), and transition 
(T). (1) Three-dimensional (3D) UMAP embedding of single-cell CyCIF data 
derived from human brain tumor sections colored by HDBSCAN cluster 
identifying a population of CD8* T cells (cluster 2, black arrow). (J) 3D UMAP 
embedding color mapped according to CD8 signal intensity showing that cluster 
2 cells (black arrow) are characterized by high levels of CD8 expression. 

(K) Overlay of matched MSI and CyCIF images from case #18, sample 3 of an 


LDH-A activity (Fig. 3E and fig. $3, E and F). 
We determined p-2HG and 1-2HG to be non- 
competitive inhibitors of LDH-mediated pyr- 
uvate reduction, with inhibition constant (Kj) 
values of 840 + 0.045 uM and 3.58 + 0.206 mM, 
respectively (Fig. 3F and fig. S3G). Given that 
the reported K; value for p-2HG is below the 
estimated intracellular p-2HG concentration, 
we inhibit LDH activity in CD8* T cells. 

A critical function of the LDH reaction is 
the regeneration of cytosolic NAD* to support 
the continued functioning of glycolysis (37). 
p-2HG treatment lowered the NAD*/NADH 
ratio of the cell (Fig. 3G). Thus, our data in- 
dicate that p-2HG inhibits LDH activity in 
CD8* T cells, resulting in altered glycolytic 
flux and NAD(H) balance. 


p-2HG treatment induces CD8* T cells to 
maximize mitochondrial respiration 


Upon activation, T cells undergo a switch from 
oxidative phosphorylation to glycolysis to support 
their proliferative and effector functions (32). 
We sought to investigate how p-2HG affects 
this metabolic switch. We tested the source 
(mitochondrial or glycolytic) of cellular ATP 
in p-2HG-treated CD8* T cells using the Sea- 
horse XF Real-Time ATP rate assay. Consistent 
with our model of p-2HG-mediated LDH in- 
hibition, p-2HG-treated CD8* T cells relied 
less on glycolysis and more on oxidative phos- 
phorylation for ATP production than untreated 
cells (Fig. 4A). Furthermore, we observed an 
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increased oxygen consumption rate and de- 
creased extracellular acidification rate upon 
p-2HG treatment (Fig. 4B and fig. S4, A to C). 

To study how p-2HG increases reliance on 
mitochondrial function in response to p-2HG, 
we assessed mitochondrial function by measuring 
changes in the mitochondrial membrane poten- 
tial (MMP). p-2HG caused hyperpolarization of 
the MMP in a dose-dependent manner (Fig. 4C 
and fig. S4D). Kinetics studies revealed that 
the membrane of p-2HG-treated mitochon- 
dria remained hyperpolarized over several days 
after T cell activation (fig. S4E). To determine 
whether the increase in MMP by p-2HG was 
reversible, we provided p-2HG to CD8* T cells 
for 2 days and then removed it (fig. S4F). 
Removing p-2HG for as little as 1 hour was 
sufficient to completely restore the MMP (Fig. 
4D). Conversely, a 2-hour exposure to p-2HG 
was sufficient to promote mitochondrial mem- 
brane hyperpolarization (Fig. 4E). The increase 
in MMP did not result from an increase in 
mitochondrial number, because p-2HG treat- 
ment did not alter the mitochondrial to nuclear 
DNA ratio or the expression of subunits of the 
ETC (fig. S4, G and H). Thus, p-2HG hyper- 
polarizes the mitochondrial membrane and 
increases mitochondrial respiration in an acute 
and reversible manner. 

The MMP is maintained by the activity of 
the proton-pumping complexes (I, IT, and IV) 
of the ETC (33). To determine whether the 
increase in MMP upon p-2HG treatment was 


IDHIF824 astrocytoma. Spatial positions of cluster 2 cells are shown in green; 
Hoechst nuclear dye (DNA) is shown in gray. Grid superimposed on the image 
shows equal-sized microregions of tissue with x and y intervals corresponding 
to the pixel size of the original MSI image used to compute statistics in 

(L) and (M). (L) Welch's t tests comparing mean tissue density—corrected 
p-2HG levels in microregions of tissue with and without CD8* T cells in case #18, 
sample 3. (M) Welch's t tests comparing mean tissue density—corrected p-2HG 
levels in microregions of tissue with and without CD8* T cells in case #19, 
sample 1. (N) Spearman's rank-order correlation showing anticorrelation 
between mean tissue density-corrected p-2HG levels and CD8* T cell counts in 
microregions of tissue in case #18, sample 3. (0) Spearman's rank-order 
correlation showing anticorrelation between mean tissue density—corrected 
p-2HG levels and CD8* T cell counts in microregions of tissue in case #19, 
sample 1. (P) Heatmap showing expression of top DE up-regulated genes in CD8* 
and CD4* T cells in IDH”" GBM relative to /DH-mutant gliomas. Gene 
expression is zero-centered and given in units of In(TP1OOK+1). (Q) Grouping of 
the top up-regulated genes into hallmark gene sets. (R) Density plot of 
expression of top DE up-regulated genes in the IFN subcluster for CD8* T cells in 
IDH” and IDH-mutant samples. (S) Density plot of expression of the top 

DE up-regulated CD8* IDH" genes in the chemokine-IFN-y subcluster for 

T cells in and DH-mutant samples. (T) Density plot of expression of the top DE 
up-regulated genes in cytotoxicity-NK subcluster for CD8* T cells in IDH“’ and 
IDH-mutant samples. (U) Overlay of the top DE ISGs from mouse RNA 
sequencing onto the top 400 DE up-regulated genes in the CD4* and CD8* 

T cell populations of /DH-mutant and ID. 
expression of genes in common between the two datasets. *P < 0.05, **P < 
0.01, ***P < 0.001, ****P < 0.0001 (Student's t test and two-way ANOVA). 
Data are representative of at least two independent experiments. 


HH" tumors showing the relative 


due to increased complex activity, we used a 
series of inhibitors against various ETC com- 
plexes (Fig. 4F). In both untreated and p-2HG- 
treated T cells, inhibition of complexes I and 
III by rotenone and antimycin A, respectively, 
lowered the MMP. Similarly, treatment with 
carbonyl cyanide ptrifluoromethoxyphenylhy- 
drazone (FCCP), an uncoupling agent that 
collapses the proton gradient, decreased the 
MMP. Conversely, inhibition of complex V by 
oligomycin, which prevents proton reentry into 
the matrix, further increased the MMP (Fig. 4G). 
Despite respiring at a higher rate, the mitochon- 
dria of p-2HG-treated CD8* T cells exhibited 
decreased maximal respiration and thus a de- 
creased spare respiratory capacity, a measure 
of the cells’ ability to respond to increased 
energy demand or stress (Fig. 4H and fig. S4, I 
and J). These data indicate that the mitochon- 
dria of CD8* T cells maximize mitochondrial 
respiration in response to p-2HG. 


Cytosolic NAD(H) imbalance supports 
mitochondrial function in p-2HG-treated 
CD8* T cells 


LDH inhibition results in cytosolic accumula- 
tion of NADH, which can enter mitochondria 
and provide reducing equivalents for the ETC 
(34). We tested whether cytosolic NADH might 
be diverted to the mitochondria for oxidation 
in response to p-2HG. Because mitochondria 
are impermeable to NADH, cytosolic NADH 
transfer is facilitated by a variety of shuttles, 
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one being the malate-aspartate shuttle (35). To 
prevent shuttling of cytosolic NADH to the 
mitochondria, we used aminooxyacetate (AOA), 
which blocks cytosolic and mitochondrial aspar- 
tate aminotransferase reactions (Fig. 41), and 
measured the state of the MMP. Although p-2HG 
treatment promoted mitochondrial membrane 
hyperpolarization, simultaneous treatment with 
AOA prevented this effect (Fig. 4J). 

To test the contribution of cytosolic NAD(H) 
imbalance to driving the increase in MMP, we 
overexpressed a cytosol-localized NADH oxi- 
dase from Lactobacillus brevis, LNOX, which 
generates NAD* from NADH independently of 
LDH activity (Fig. 4, K and L) (36). In the 
presence of LbNOX, the NAD*/NADH ratio of 
p-2HG-treated CD8* T cells was increased 
(Fig. 4M). Cytosolic LONOX overexpression 
lowered the MMP in the presence of p-2HG 
(Fig. 4N and fig. $4, K and L), indicating that 
cytosolic NAD*/NADH imbalance partially 
drives the MMP of the cell. 

Because NADH is oxidized by complex I of 
the ETC, we tested whether interventions that 
limit NAD* recycling in the mitochondria might 
exacerbate the effect of p-2HG treatment. 
We exposed p-2HG-treated CD8* T cells to 
rotenone, a complex I inhibitor, and monitored 
the cells’ ability to proliferate (Fig. 40). Although 
rotenone alone had a small effect on prolifera- 
tion, the combination of rotenone and p-2HG 
impaired cell proliferation more strongly than 
p-2HG alone (Fig. 4P). Thus, NADH oxidation 
by complex I of the ETC appears to be impor- 
tant to sustain the proliferative potential of CD8* 
T cells in the presence of p-2HG. Because 
complex I is the main source of reactive oxy- 
gen species (ROS) in the mitochondria (33, 37), 
we measured ROS and oxidative stress levels 
in response to p-2HG. p-2HG significantly 
increased intracellular ROS levels, which were 
decreased in cells exposed to the antioxidant 
N-acetyl-cysteine (Fig. 4Q and fig. S4M). Ac- 
cordingly, p-2HG treatment was accompanied 
by a significant increase in total glutathione 
(GSSG) levels, resulting in a lower GSHyequcea/ 
GSSGoxiaizea Patio, which is indicative of cellular 
redox stress (fig. S4, N to P). Overall, these data 
support the idea that p-2HG increases depen- 
dency on mitochondrial complex I activity, re- 
sulting in higher ROS production. 


LDH inhibition is sufficient to recapitulate the 
effects of p-2HG on CD8* T cell metabolism and 
proliferation and IFN-y signaling and cytotoxicity 


To assess the contribution of LDH inhibition 
to the metabolic phenotypes caused by p- 
2HG, we tested whether LDH inhibition alone 
was sufficient to phenocopy treatment of cells 
with p-2HG using two well-established LDH 
inhibitors, oxamate and GSK-2837808A (ab- 
breviated as GSK). Cg, glucose tracing in the 
presence of either LDH inhibitor recapitu- 
lated the effects of p-2HG on glucose catabo- 
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lism. The LDH inhibitors each promoted the 
accumulation of glycolytic intermediates up 
to pyruvate, with concomitant loss of lactate 
labeling (fig. S5, A and B), resulting in a lower 
lactate (M+3)/pyruvate (M+3) ratio (Fig. 5A). 
Inhibition of LDH activity by the inhibitors 
was further validated by metabolite analysis in 
cell culture medium, in which we observed in- 
creased pyruvate and decreased lactate secre- 
tion relative to that from untreated cells (fig. 
S5, C and D). Consistent with LDH inhibi- 
tion, the NAD*/NADH ratio of the cell was also 
decreased in response to either of the inhibi- 
tors (Fig. 5B). Thus, oxamate and GSK recapitu- 
late the effects of p-2HG on glucose catabolism. 

Impairment of LDH activity by p-2HG caused 
CD8* T cells to undergo a metabolic transi- 
tion toward oxidative phosphorylation. There- 
fore, we tested whether LDH inhibition was 
sufficient to cause this effect. LDH inhibition 
by oxamate or GSK recapitulated the increase 
in oxygen consumption rate, the decrease in 
extracellular acidification rate, and the hyper- 
polarization of the mitochondrial membrane 
observed in cells treated with p-2HG (Fig. 5, C 
and D, and fig. S5, E to G). No change in the 
mitochondrial/nuclear DNA ratio of the cell 
was detected after LDH inhibition (fig. S5H). 
Therefore, LDH inhibition is sufficient to cause 
increased reliance on mitochondrial respira- 
tion and hyperpolarization of the mitochon- 
drial membrane in CD8* T cells. 

To test whether LDH inhibition is sufficient 
to cause the immune phenotypes promoted 
by p-2HG accumulation, we tested the effects 
of LDH inhibition alone on viability, antigen- 
driven proliferation, IFN-y production, and 
cytotoxicity. Each of the LDH inhibitors im- 
paired T cell proliferation despite having no 
effect on cell viability (Fig. 5E and fig. S51). A 
p-2HG-mediated decrease in cytotoxicity was 
also observed in response to each of the LDH 
inhibitors. Upon LDH inhibition, CD8* T cells 
exhibited lower expression of granzyme B and 
failed to degranulate upon antigen restim- 
ulation (Fig. 5, F and G, and fig. S5J). IFN-y 
production was also decreased in response to 
LDH inhibition, resulting in a lower percent- 
age of IFN-y* CD8* T cells, decreased IFN-y 
accumulation in the media, and a subsequent 
decrease in the induction of ISGs (Fig. 5, H 
to K, and fig. S5K). Ultimately, these cytotoxic 
defects culminated in impaired killing of tumor 
cells upon LDH inhibition (Fig. 5L and fig. S5L). 
Thus, LDH inhibition alone phenocopies the 
effects of p-2HG on CD8* T cell proliferation, 
activation, IFN-y signaling, and cytotoxicity. 


IDH-mutant glioma-infiltrating CD8* T cells 
exhibit altered metabolic and cytotoxic 
programs in patient samples 


To determine whether p-2HG impairs anti- 
tumor immunity, we used two different syn- 
geneic mouse tumor models. We injected B16 


melanoma and MC38 colorectal adenocarci- 
noma cells overexpressing IDH1™" or DHT" 
into the flanks of C57BL/6J mice (Fig. 6A). On 
day 16, when the tumors were comparable in 
weight, we isolated tumor interstitial fluid and 
detected low-millimolar levels of p-2HG in 
tumors harboring the JDH7 mutation (Fig. 6B 
and fig. S6A). Flow cytometry profiling of cells 
in the immune infiltrate of these tumors re- 
vealed changes in lymphocytes (fig. S6, B and 
C), including reduced infiltration, prolifera- 
tion, and cytokine levels of CD8* T cells in 
IDH1*"**" tumors in both models (Fig. 6, C to 
E). These results suggest that tumor-derived 
p-2HG impairs T cell responses in the TME 
of IDH1™'*°" tumors. 

Mouse models of JDH do not fully recapit- 
ulate the developmental and morphological 
features of human /DH-mutant cancers, ar- 
guing for characterization of human specimens. 
Moreover, the reversible and acute nature of the 
metabolic phenotypes caused by p-2HG makes 
it important to study these properties in an 
intact TME. Thus, we leveraged mass spectrom- 
etry imaging (MSD to study the tumor metabolic 
landscape of five patients with IDH-WT and 17 
patients with 7DH-mutant gliomas (table S1). 
Quantification of 2HG levels in tissue sections 
revealed a range of high-millimolar concen- 
trations of 2HG across all JDH-mutant gliomas 
regardless of grade (Fig. 6F and fig. S6D). The 
2HG signal was restricted to the tumor area 
and was absent in surrounding healthy tissue, 
indicating that 2HG levels accumulate at the 
tumor site (fig. S6E). Comparison of IDH1®'°" 
grade IV astrocytomas with IDH1™" glioblas- 
toma (GBM) revealed trending lower lactate 
levels in JDH1"*”" relative to IDH1™" gliomas 
(Fig. 6, G and H, and fig. S6, F to H). 

Because of the immunosuppressive prop- 
erties of p-2HG, we hypothesized that tumor 
regions characterized by high levels of the 
oncometabolite would contain fewer CD8* 
T cells compared with those with lower expres- 
sion. To test this hypothesis, we coupled cyclic 
immunofluorescence (CyCIF) of IDH1®°?" 
grade IV astrocytomas with MSI rasters of 
p-2HG collected on the same tissue sections 
to study the location of CD8* T cells with re- 
spect to p-2HG expression. Dimensionality re- 
duction [uniform manifold approximation and 
projection (UMAP)] and clustering [hierarchi- 
cal density-based clustering (HDBSCAN)] algo- 
rithms applied to CyCIF-derived single-cell data 
identified a population of CD8* T cells that 
coexpressed CD8a and CD45RB (Fig. 6, I and 
J, and fig. S6I). Mapping the coordinates of 
these cells onto p-2HG MSI rasters, we ob- 
served a tendency for these cells to be less 
present in regions of tissue high in the on- 
cometabolite (Fig. 6K). To quantify this effect, 
tissues were divided into microregions the size 
of pixels in the original MSI rasters before 
computing mean tissue density-corrected 
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p-2HG expression per region. Independent, 
two-way Welch’s ¢ tests were then used to com- 
pare microregions with and without at least 
one CD8" T cell (Fig. 6, L and M). Across 
multiple tissue samples from different patients, 
this analysis revealed that p-2HG expression 
was significantly lower in regions of tissue 
containing CD8* T cells compared with those 
that did not. Correlation analysis comparing 
p-2HG levels and CD8* T cell numbers using 
Spearman’s rank-order method established a 
significant anticorrelation between these varia- 
bles that was not explained by concurrent anti- 
correlation between CD8s* T cells and tissue 
density (Fig. 6, N and O, and fig. S6, K, L, N, 
and O), suggesting that the avoidance of re- 
gions of tissue high in p-2HG by CD8* T cells 
was not due to their failure to infiltrate tumor 
tissue. Together, these studies reveal a dose- 
dependent relationship between p-2HG and 
CDs* T cell density in human gliomas and sup- 
port the hypothesis that p-2HG may regulate 
CDs* T cell proliferation in this context. 

To better understand the physiological 
role of p-2HG accumulation in the micro- 
environment of JDH-mutant tumors on CD8* 
T cell function, we analyzed single-cell RNA- 
sequencing data from a different set of DH" 
GBM patients (nm = 16) and JDH-mutant glioma 
patients (2 = 15) (38) and profiled the gene 
expression signature of tumor-infiltrating T cells. 
We identified the top differentially up-regulated 
genes in CD8* and CD4"* T cell subpopulations 
and grouped these into hallmark gene signa- 
tures (Fig. 6P and fig. S6P). This showed that 
several of the top down-regulated genes in 
CD8* T cells from JDH-mutant samples fall 
under the IFN signaling signature (Fig. 6Q). 

Previous analysis of this single-cell RNA- 
sequencing dataset revealed six subclusters 
for CD8* T cell subsets [cytotoxicity-natural 
killer (NK), IFN, effector memory, chemokine- 
IFN-y, not determined, and stress cells] and 
CD4* T cell subsets (cytotoxicity-chemokine, 
IFN, effector memory, Treg, memory, and stress 
cells) (38). Thus, we assessed the gene expres- 
sion signature of the top 20 up-regulated genes 
in JDH™™ GBM within each of the previously 
identified subclusters. The top 20 up-regulated 
genes gave a weaker signature score in the 
cytotoxic, IFN, and IFN-y subclusters for CD8* 
T cells from JDH-mutant gliomas compared 
with IDH“™ GBM (Fig. 6, R to T). The ex- 
pression of the top up-regulated genes was not 
affected in the Tregs and cytotoxicity sub- 
clusters for CD4* T cells, but it was elevated 
in the CD4* IFN subcluster for IDH“* com- 
pared with 7DH-mutant (fig. S6Q). Thus, we 
concluded that the cytotoxic and IFN pro- 
grams of effector T cells, in particular effector 
CD8* T cells, are lower in JDH-mutant sam- 
ples compared with IDH™". 

Finally, we validated the relevance of the 
ex vivo-derived IFN-y signature from p-2HG- 
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treated mouse CD8" T cells (fig. SIG) using 
the human single-cell RNA-sequencing data. 
We generated a gene list containing the top 
differentially expressed (DE) ISGs upon p-2HG 
treatment from the mouse RNA-sequencing 
experiment and used this gene list to filter the 
top 400 DE genes in CD4* and CD8* T cell 
populations from JDH-mutant and [DHT 
tumors. Overall, both CD8* and CD4* T cell 
populations from JDH-mutant tumors showed 
decreased expression of the IFN-induced mouse 
gene signature (Fig. 6U), demonstrating that the 
human IFN-y signature of CD8* T cells derived 
from JDH-mutant tumors is similar to that of 
p-2HG-treated mouse CD8* T cells. 


Discussion 


Our results suggest that p-2HG may not only 
affect tumor initiation and growth through 
tumor cell-intrinsic mechanisms but also by 
directly affecting the cells in the surrounding 
TME. In the context of antitumor immunity, 
IDH-mutant tumors are surveyed by immune 
cells, which, upon infiltrating, face a newly 
p-2HG-rich microenvironment. /DH-mutant 
tumors have fewer tumor-infiltrating T cells 
compared with IDH“™ tumors (7, 13-16), 
but the factors that mediate the immunosup- 
pressive functions of p-2HG have remained 
elusive. We propose that p-2HG acts directly 
on CD8* T cells in the TME of JDH-mutant tu- 
mors by altering their metabolic and cytotoxic 
signatures. 

Glucose metabolism has a central role in 
regulating cytotoxic T cell responses. As ef- 
fector CD8* T cells undergo extensive prolif- 
eration in response to antigen stimulation, 
maintaining a high glycolytic flux is needed to 
support the bioenergetic needs, as well as to 
provide the building blocks for cellular biomass 
(32). Furthermore, glucose metabolism has also 
been tightly linked to IFN-y signaling and T cell 
cytotoxicity (30, 39-46). Our findings show that 
p-2HG alters glucose metabolism in CD8* T cells, 
resulting in impaired proliferation, cytokine 
production, and cytotoxicity. Specifically, we show 
that decreased IFN-y production in p-2HG- 
treated CD8* T cells results in a suppressed auto- 
crine IFN-y gene expression signature, which 
has been shown to be important for effector 
function (47). Moreover, although a link be- 
tween glucose metabolism and the expression 
of lytic molecules has previously been made (40), 
our work has uncovered a previously unknown 
role for LDH and p-2HG in regulating the pro- 
cess of degranulation. Thus, our studies expand 
upon previous work demonstrating the role of 
glycolytic metabolism in regulating T cell ef- 
fector function and proliferation. 

Analysis of the metabolic landscape of human 
gliomas has revealed altered lactate levels and 
decreased IFN-y signatures in glioma samples 
from JDH-mutant patients. Mechanistically, we 
propose that p-2HG alters CD8* T cell metab- 


olism through direct LDH inhibition. Decreased 
LDH-A activity caused by epigenetic silencing 
in JDH-mutant tumors had previously been 
reported (48-51), but our work shows that 
p-2HG can additionally have a new, direct and 
immediate effect on LDH enzymatic activity 
in CD8* T cells. Future studies are required 
to investigate the in vivo immunotherapeutic 
relevance of the proposed mechanism and 
determine whether p-2HG inhibition of LDH 
activity expands beyond CD8" T cells in vivo, 
affecting other cells within the TME. 

Finally, we found that many of the effects 
caused by p-2HG are reversible upon p-2HG 
removal. In the future, it will be interesting to 
determine whether therapeutic interventions 
involving US Food & Drug Administration- 
approved, mutation-specific IDH inhibitors 
potentiate immune responses by removing 
immunosuppressive p-2HG from the TME, in 
addition to having a direct role in the inhibi- 
tion of tumor growth. In support of this idea, 
recent studies have shown that CD8* T cell 
infiltration is restored in the presence of in- 
hibitors of mutant IDH (52). 
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Earth’s composition was modified by collisional erosion 


Paul Frossard"*, Claudine Israel’, Audrey Bouvier**, Maud Boyet? 


The samarium-146 (!46Sm)-neodymium-142 (42Nd) short-lived decay system (half-life of 103 million 
years) is a powerful tracer of the early mantle-crust evolution of planetary bodies. However, an increased 
142vd/“4Nd in modern terrestrial rocks relative to chondrite meteorites has been proposed to be 
caused by nucleosynthetic anomalies, obscuring early Earth’s differentiation history. We use stepwise 
dissolution of primitive chondrites to quantify nucleosynthetic contributions on the composition of 
chondrites. After correction for nucleosynthetic anomalies, Earth and the silicate parts of differentiated 
planetesimals contain resolved excesses of '42Nd relative to chondrites. We conclude that only collisional 
erosion of primordial crusts can explain such compositions. This process associated with planetary 
accretion must have produced substantial loss of incompatible elements, including long-term heat- 
producing elements such as uranium, thorium, and potassium. 


hort-lived radioactive systems are used 
as high-resolution chronometers of early 
Solar System events. Notably, the “°Sm- 
“42nd pair provides important constraints 
on mantle-crust evolution of early Earth 
and its modern composition. In 2005, Boyet 
and Carlson (J) were the first to find evidence 
for an offset of 20 parts per million (ppm) in 
the “?Nd /“4Nd ratio between terrestrial sam- 
ples and chondrites that are usually consid- 
ered to be building blocks of planets. They 
interpreted the excess of “?Nd in Earth as the 
result of an early differentiation event and 
subsequent burial and isolation of this early- 
formed crust, which left the mantle depleted 
in incompatible elements and, over time, en- 
riched in “?Nd. Alternative scenarios such as 
collisional erosion of the early crust or ac- 
cretion of Earth from material with a non- 
chondritic Sm/Nd ratio have been invoked 
to explain the observed excess (7-3). 
Chronological information from short- 
lived radioactive systems is complicated by 
nucleosynthetic anomalies, which are mass- 
independent stable isotope variations that 
are inherited from incomplete mixing of the 
solar nebula [e.g., (4)]. Neodymium and Sm 
isotopes are produced by a mixture of s-, 7-, 
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and p-nucleosynthetic processes, which corre- 
spond to slow and rapid neutron capture and 
proton capture or photodisintegration of large 
nuclei, respectively. The contribution of the 
p-process to “Nd is particularly ill-constrained 
because it cannot be identified by measuring 
the Nd isotope composition alone. Estimates 
range from 1 to 20% according to astrophys- 
ical models and cosmochemical constraints 
(5-11). p-Process variation is determined by 
combining Sm and Nd isotope measurements 
because “Sm is a pure p-process nuclide (4, 8). 

A radiogenic origin for the excess of “?Nd in 
Earth was recently argued against on the basis 
of evidence that Earth lies on the “°Sm-"’Nd 
isochron of the first solids formed in the Solar 
System, the Ca-Al-rich inclusions (CAIs) (5), as 
well as on s-process mixing lines with chon- 
drites (6). Earth shares closest affinities with 
the subgroup of EL3 enstatite chondrites (72). 
Nevertheless, the potential contribution of the 
p-process on “Nd is poorly constrained [e.g., 
(7)]. Nucleosynthetic compositions identified 
in meteorites deviate from the astrophysical 
models of nucleosynthesis and may not be sim- 
ilar in all chondrite groups (8, 13). 

In this work, we report a comprehensive set 
of coupled Nd and Sm isotope compositions of 
bulk rocks and of components of unequili- 
brated chondrites from both the noncarbona- 
ceous (NC) (enstatite and ordinary chondrites) 
and the carbonaceous (CC) groups. Our objec- 
tive was to identify the sources and variability 
of nucleosynthetic anomalies, including the 
p-process, that potentially affect the “’Nd/“*Nd 
ratio in bulk meteorites. Mass-independent 
Nd and Sm isotope compositions, which were 
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Fig. 1. Coupled p2“2Nd and y“*Sm and coupled "4Nd and p.“8Nd composition of leachates. (A) The 
regression line is calculated by using solely leachates from NC chondrites and leachates L8 to L13 of Orgueil to 
avoid the contribution of CAls. 1’“*Nd compositions of leachates are corrected (corr. for the decay of “@Sm (16). 
Leachates of NC chondrites do not deviate from the leachate regression line that represents the s-process, 
which evidences the homogeneous p-process contribution to Nd and the lack of pure p-process carriers in NC 
chondrites. CAl compositions are from (5, 6, 17, 18), which are compiled in (25). (B) Leachates are used to 
calculate a regression line that models the s-process trend in meteorites, which accurately relates the 
nucleosynthetic compositions of “Nd and u"“*Nd. The dashed line represents the mixing line between Earth 
and the s-process estimates from astrophysical models (11). Errors are twice the standard error of a single 
measurement or twice the standard deviation of the standard processed with similar contents in the same 
analysis session, whichever is larger. Errors on the regression are 95% confidence intervals that were calculated by 


using IsoplotR (21). 


normalized to fixed “°Nd/™*Nd or “’Sm/ 
52Sm ratios, are hereafter expressed as devia- 
tions from a synthetic terrestrial standard in 
ppm {Xx = [CX/X)sampie/ (X/X)standara -Ix 
10°, where i and j are isotopes of the element 
X}. We used a recently improved mass spec- 
trometric method (/4) for Nd isotope analyses 
of eight bulk chondrites and some of the sepa- 
rated components of chondrites. Pooled chon- 
drules and a single large chondrule (~200 mg) 
from the ordinary chondrite NWA 8007 (L3) 
and magnetically separated and sorted frag- 
ments of the EH3 chondrite Sahara 97158 
were analyzed only for their Nd isotope com- 
positions. Enstatite (Sahara 97158-EH3, PCA 
91020-EL3), ordinary (NWA 8007-L3.2), and 
carbonaceous (Orgueil-CT) chondrites were se- 
quentially dissolved with increasingly aggres- 
sive acids and analyzed for their Nd and Sm 
isotope compositions. This procedure produced 
several fractions (named LI to L13), thereby 
isolating the most refractory minerals that in- 
clude presolar grains with the most extreme 
isotopic compositions (8, 13, 15). 

Leachates from enstatite and carbonaceous 
chondrites display much larger nucleosynthetic 
anomalies than bulk chondrites in the range of 
tens to hundreds of ppm in magnitude (fig. S1 
and table S1). The first leachates have positive 
u45-148.150NTq anomalies, whereas those pro- 
duced by the last and most aggressive disso- 
lution steps (L6 and L7 for EH and EL and L8 
to L13 pooled together for CI chondrites) have 
negative p'*°'48-°Nd anomalies. Leachates 
L8 to L13 of the Orgueil CI chondrite reach 
extreme compositions, down to p°°Nd of 
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-20,336 + 47 (twice the standard error). By 
contrast, the leachates and chondrules of or- 
dinary chondrites (OCs) and separates of the 
EH3 chondrite exhibit only small variations. 
Corrections for radiogenic contributions of 
M46sm are needed to interpret nucleosynthetic 
anomalies on u*?Nd. Because sequential dis- 
solution may fractionate Sm and Nd (8), we 
calculated the corresponding '*7Sm/'*Nd 
ratios using the '?Nd/™“*Nd ratio of each 
sample (12) after correction for nucleosyn- 
thetic anomalies on the “*Nd/™*Nd ratio (16). 

Most leachates of NC chondrites display 
Sm isotope compositions that are indistin- 
guishable from the terrestrial standard. Small 
deviations in all Sm isotope ratios are observed 
for leachates of CC chondrites. Variations in 
u°sm and p’°°Sm reflect neutron capture re- 
actions, but this effect is negligible for other 
Nd and Sm isotope ratios (up to 1.9 ppm on 
u“°Nd; fig. S2) (16). The largest variations are 
obtained in L8 to L13 of the Orgueil meteorite, 
with a u'“*Sm of -1797 + 176 and au“*Sm of 
8176 + 48 (twice the standard error) (fig. S2 
and table S2). 

Nucleosynthetic p-process contributions 
on p"?Nd are constrained by using p““Sm in 
leachates (Fig. 1A). Deviations from the trend 
modeled by NC leachates and L8 to L13 of 
Orgueil are solely observed for leachates of 
CC chondrites and are related to signatures 
of CAIs that display large deficits in p-process 
M445m and small deficits in ?Nd (5, 6, 17, 18). 
However, no other p-process endmember is 
identified in leachates from all chondrite 
groups. s-Process excesses are indistinguishable 


from 7-process deficits with Nd and Sm isotope 
compositions. We constrain endmembers using 
amass balance calculation on leachates by 
using presolar SiC that is the only s-process- 
rich carrier of rare earth elements identified 
so far [e.g., (13, 16)] (fig. $3). Small amounts 
of presolar SiC are therefore enough to pro- 
duce the range of Nd isotope compositions 
measured in leachates, without invoking an 
r-process carrier. The CAIs that bear fraction- 
ated and unknown nuclear isotope compo- 
sitions (FUN CAIs) display large anomalies 
(19). They have, however, a negligible effect 
on the Nd and Sm isotope compositions of 
leachates and bulk rocks of CC chondrites be- 
cause of their very rare occurrence in CV 
chondrites. We also confirm that Nd and Sm 
S-process compositions estimated by astro- 
physical models are substantially different 
compared to leachates (figs. S4 and S5), as 
pointed out by Saji et al. (20). 

Given that Nd isotope composition in chon- 
drites is solely related to s-process variations 
induced by variable amounts of presolar SiC, 
leachates can be used to define the nucleo- 
synthetic trend by comparing a stable isotope 
ratio with u'*?Nd (Fig. 1B). For consistency 
and after comparison with previous studies 
(8, 13, 20) (figs. S5 and S6), we use only leach- 
ates from this study to construct regression 
lines that define the nucleosynthetic varia- 
bility in chondrites. We obtain slopes by total 
least squares regression of 1““°Nd or “Nd on 
u“?Nd of -1.599 + 0.007 and —0.862 + 0.002 
[95% confidence interval; (27)], respectively. 
However, the intercepts of the regression lines 
(theoretical Earth’s u'*?Nd) are poorly con- 
strained, owing to the low precision for many 
leachates with Nd isotope compositions close 
to the terrestrial value (fig. S7). 

Bulk chondrites provide the best samples to 
define accurately the Nd isotope compositions 
of Earth because of their much higher pre- 
cision compared to leachates and their unfrac- 
tionated Sm/Nd ratio. We critically assessed 
data from the literature to offer an up-to-date 
compilation of the highest precision measure- 
ments on meteorites (table S1) (6, 9, 22-25). 
The compilation gives weighted averages with 
two standard errors of p“?Nd = -9.9 + 2.5, 
wNd = 0.5 + 1.5, w“°Nd = 2.2 + 1.4, and 
wP°Nd = 8.0 + 5.1 for enstatite chondrites 
(ECs) and p?Nd = -13.9 + 1.8, “°Nd = 4.4 + 
1.7, “Nd = 7.6 + 1.3, and p°Nd = 14.9 + 
3.3 for OCs. 

We used slopes defined by leachate regres- 
sion lines linking a stable isotope ratio to 
u“?Nd to calculate the »““°Nd composition of 
chondrites without nucleosynthetic anomalies 
relative to Earth. The theoretical composition 
of Earth, assuming NC chondrites are its main 
building blocks, is calculated in this manner. 
We corrected EC and OC “Nd isotope com- 
positions for nucleosynthetic anomalies using 
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their measured “Nd and u"“*Nd. We find 
very similar results, with an average of 1“°Nd = 
-7.9 + 1.9 (twice the standard deviation, n = 4) 
(Fig. 2A). This negative value of 1“?Nd relative 
to Earth contrasts with p“°Nd and u"“8Nd, 
which lie on the leachate regression line that 
passes through the NC chondrites (Fig. 2B). 
We did not consider CC chondrites because 
of their depletion in usm, which suggests 
the presence of p-process-depleted material, 
likely related to the presence of CAIs that can 
substantially alter the theoretical “*Nd com- 
position of Earth (Fig. 1A and fig. S8). After 
correcting y*?Nd for nucleosynthetic anom- 
alies, the corrected data for NC chondrites are 
resolved from Earth’s “?Nd composition by 
using Student’s ¢ tests, with p values [signifi- 
cance level (a) of 0.05] between 6.7 x 1077 and 
4.8 x 10~*. Recent studies (9, 24, 25) reported 
high-precision Nd isotope compositions of 
achondrites by sampling early planetesimals. 
When corrected for nucleosynthetic anomalies, 
their :"*°Nd values are between those of Earth 
and NC chondrites, and more specifically, 
angrites and eucrites are resolved from NC 
chondrites (p < 0.007). 

The cause of the excess in y*?Nd in Earth 
and achondrites may be related to several 
factors: (i) a p-process excess, (ii) accretion of 
material with a nonchondritic composition, 
or (iii) early silicate differentiation and iso- 
lation of the enriched reservoir. We did not 
identify any p-process-rich carriers in leachates 
of chondrites, and such a carrier is unlikely to 
be present only in Earth and achondrites. The 
lack of precision in ““*Sm of bulk chondrites 
does not allow for the elimination of this pos- 
sibility entirely, but current datasets imply 
that a contribution of p-process of this mag- 
nitude is nevertheless unlikely. Chondrites, de- 
spite having been formed 2 million to 3 million 
years after the first planetesimals [e.g., (26)], 
are representative of the early Solar System 
composition before differentiation of planet- 
size objects. The most likely hypothesis is that 
the excess of !Nd in Earth and achondrites 
reflects an early silicate differentiation of 
planets and planetesimals that produced 
mantles with superchondritic Sm/Nd ratios. 
Achondrite parent bodies accreted within 
1.5 million years after the crystallization of 
CAI [e.g., (26)] and experienced global magma 
ocean stages. Early crustal material may have 
been buried and isolated in the mantle of large 
bodies during the overturn of high-density 
cumulates during the evolution of magma 
oceans [e.g., (27)]. However, the presence of 
hidden crustal reservoirs in the mantle of plan- 
etesimals is very unlikely because it would 
imply overturn mechanisms that are difficult 
to produce on planetesimals (27). The absence 
of achondrites displaying subchondritic Sm/ 
Nd and Hf/Sm ratio variation (28), for in- 
stance, also corroborates this interpretation. 
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Fig. 2. s-Process nucleosynthetic A 5 


contributions on Nd isotope 
compositions of bulk NC 
meteorites. Nd isotope compositions 
of meteorites have been compiled to 
include only the most precise and 
accurate data (table S3). Averages 

of meteorite groups are plotted, except 
for the three ungrouped achondrites. 
(A) The w“?Nd-u!®Nd space shows 
the radiogenic excess of “Nd in Earth 
and NC achondrites. The two lines 
represent the s-process nucleo- 
synthetic trend with the slope 
determined with leachates of this 
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Destruction of the primordial crust through 
collisions or explosive volcanism (29) are the 
more suitable processes to explain the ubiq- 
uitous excess in “?Nd in both Earth and plan- 
etesimal samples (Fig. 3). We favor collisional 
erosion in light of the overwhelming evidence 
of common collisions in the early Solar Sys- 
tem that led to the accretion of planets and 
the quantification of this process by numer- 
ical models [e.g., (30, 31]. 

We estimate the amount of crust lost during 
collisional erosion using mass balance calcu- 
lations. Given the antiquity of the early plan- 
etesimals, silicate differentiation events most 
likely occurred very early (<1 million years ago 
for achondrite parent bodies). We assume that 
collisional erosion for both Earth and plane- 
tesimals occurred 4.567 billion years ago (32). 
Given Earth’s excess in “?Nd of 7.9 ppm com- 
pared with that of NC chondrites, we infer a 
47$m/™4Nd ratio of 0.2012 + 0.0051 (“Nd = 
3.3 + 4.4) for Earth’s modern mantle or early 
depleted reservoir, which is higher than the 
chondritic reference of 0.1960 + 0.0004 (33). 
We estimate the mass of crustal material that 
was removed during collisional erosion events 
for various crustal compositions (Fig. 4). Con- 
sidering crusts with a low Sm/Nd ratio and 
high Nd contents [lunar KREEP (potassium 


1 1 1 Si 
T 


Earth’s mantle 
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rare earth elements and phosphorus), Earth’s 
continental crust, and martian crust, (34-36), 
respectively], the mass balance suggests that up 
to 5% of the total initial accreted mass needs to 
be removed. About 4 to 20% of the total initial 
mass is required to be eroded by using a crust 
with lower Sm/Nd ratio and Nd contents such 
as those of the lunar Mg suite (37) and of the 
achondrite GRA 06128/9 (38). The accretion of 
differentiated planetesimals as building blocks 
of Earth is supported by models of collisional 
erosion that estimate the mass eroded during 
accretion from ~10 to 20% of the total initial 
mass of all planetesimals [e.g., (31, 39)]. 

The ubiquity of collisional erosion recorded 
by Nd isotope compositions suggests substan- 
tial consequences for the highly incompatible 
elemental compositions of Earth and plane- 
tesimals. In this regard, K, Th, and U are critical 
as radioactive heat-producing elements [e.g., 
(40)]. We are limited in estimating the deple- 
tion in K, Th, and U in primordial crusts owing 
to the lack of complete meteorite igneous suites 
from various parent bodies. K, Th, and U are 
more incompatible than rare earth elements, 
and the amount lost during collisional erosion 
is estimated to be greater than 30%. 

Chondrites have long been considered to be 


the building blocks of Earth. However, growing 
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Fig. 3. Mass proportion of 
crust eroded by collisions to 
produce Earth’s p!42Nd. 

The composition of a hypothet- 
ical crust is calculated by mass Sm/Nd < 0.285 
balance by using a starting 
composition at -7.9 ppm (NC 
chondrites) and various types of 
crustal materials. The red area 
corresponds to compositions 
with a low Sm/Nd ratio 
(<0.285) and high Nd content 
represented by lunar KREEP 
(34), Earth's continental crust 
(35), and the martian crust from 
the basaltic breccia NWA 7034 -15 : ; 
(36). The blue area corresponds 0 >) 10 15 20 25 30 
to more mafic compositions 0 

with a higher Sm/Nd ratio Meroded My (%) 

(>0.285) and lower Nd content, 

which is represented by the lunar Mg suite (37) and the alkali- and silica-rich crust of a planetesimal with 
GRA 06128/9 (38). Lunar ferroan anorthosites (42) are also represented as a black line. Merodeq/Mo, the 
mass proportion of crust eroded from the total accreted mass. 
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Fig. 4. Sketch of the collisional erosion scenario on planetesimals and Earth. Collisions between 
differentiated planetesimals strip off protocrusts, which leaves bodies with a mantle that is enriched in Sm 
over Nd compared with undifferentiated chondrites [Sm/Ndcyur = 0.311, where CHUR stands for chondritic 
uniform reservoir; (33)]. Earth accreted from collisionally eroded planetesimals is depleted in incompatible 
elements and displays a superchondritic Sm/Nd ratio. 
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CYTOSKELETON 


Actin maturation requires the 
ACTMAP/C190rf54 protease 


Peter Haahr’, Ricardo A. Galli°, Lisa G. van den Hengel**, Onno B. Bleijerveld®, 
Justina Kazokaité-Adomaitiené®, Ji-Ying Song’, Lona J. Kroese®, Paul Krimpenfort®, 
Marijke P. Baltissen®, Michiel Vermeulen®, Coen A. C. Ottenheijm?, Thijn R. Brummelkamp*** 


Protein synthesis generally starts with a methionine that is removed during translation. However, 
cytoplasmic actin defies this rule because its synthesis involves noncanonical excision of the acetylated 
methionine by an unidentified enzyme after translation. Here, we identified C190rf54, named ACTMAP 
(actin maturation protease), as this enzyme. Its ablation resulted in viable mice in which the 
cytoskeleton was composed of immature actin molecules across all tissues. However, in skeletal 
muscle, the lengths of sarcomeric actin filaments were shorter, muscle function was decreased, and 
centralized nuclei, a common hallmark of myopathies, progressively accumulated. Thus, ACTMAP 
encodes the missing factor required for the synthesis of mature actin and regulates specific 


actin-dependent traits in vivo. 


he actin cytoskeleton organizes cellular 

architecture, force generation, and cell 

migration (7). Humans express six highly 

abundant actin isoforms: two cytoplasmic 

actins and four muscle actins. The bio- 
synthesis of all actin molecules requires 
N-terminal processing referred to as actin 
maturation (2-6). For cytoplasmic B- and 
y-actin molecules, the initiating acetyl methi- 
onine is removed after translation by an un- 
known enzyme (Fig. 1A). For muscle actin, the 
initiator methionine is canonically removed, 
and the exposed cysteine is acetylated and 
then trimmed by an unknown enzyme (fig. S1) 
(5, 6). The trimmed N terminus is then reacety- 
lated by NAA80, generating mature actin (7-9). 
However, 40 years after the observation of 
N-terminal actin processing (3), we still lack 
a clear understanding of this pathway. 

To study actin maturation, we generated an 
antibody that recognizes the immature N ter- 
minus of f-actin (Fig. 1B). In the absence of a 
specific antibody for mature f-actin, an anti- 
body for mature y-actin was used (Fig. 1B). 
Having previously applied haploid genetics to 
study posttranslational modifications (J0-12), 
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we performed two complementary haploid 
screens to find regulators of actin maturation. 
Mutagenized cells were stained with actin 
antibodies, and cell populations with relatively 
high or low antibody staining were isolated 
(Fig. 1C) (10). We looked for genes that were 
enriched for disruptive gene-trap mutations in 
the cell population with high immature actin 
staining (negative regulators; data S1) and 
genes that were enriched for mutations in 
the cells displaying low levels of mature actin 
(positive regulators; data S2). C190rf54 ful- 
filled these parameters and scored as the top 
hit (Fig. 1, D to F, and fig. S2A). In contrast to 
other hits, C790rf54 has never been identified 
in screens unrelated to actin processing, sug- 
gesting specificity for this process (fig. S2B) 
(10-12). We renamed this factor ACTMAP 
(actin maturation protease) for reasons de- 
scribed below. 

Ablation of ACTMAP resulted in undetectable 
actin maturation and accumulation of imma- 
ture actin (Fig. 1, G to I). Moreover, we ob- 
served no effect on the total levels of actin 
using a pan-actin antibody raised against the 
C-terminal region of actin (Fig. 1, H and I). 
Additionally, no overt changes were detected 
in cytoskeletal organization (fig. $3), G-actin/ 
F-actin ratios (fig. S4, A and B), F-actin 
staining intensities (figs. S4C and S5, A to E), 
filopodia formation (fig. S6, A to F), cell size 
(figs. S5, C to E, and S7, A and B), or cell moti- 
lity (figs. S8, A and B, and S9, A to F). As an 
independent approach to antibody detection, 
we used bottom-up proteomics to study the 
actin N terminus. We found that N-terminal 
actin peptides in HAP!1 cells were almost ex- 
clusively detected in the mature state (Fig. 1, 
J and K, and fig. S10), as has been reported 
previously (73, 14). Conversely, loss of ACTMAP 
resulted in actin molecules exclusively detected 
in their immature acetylated and methionated 
state (Fig. 1, J and K, and fig. S10). As expected, 


NAA80-knockout cells exclusively contained 
the nonacetylated form of mature actin (Fig. 
1, J and K, and fig. S10). Thus, ACTMAP is 
critical for actin maturation and stimulates 
the excision step of acetylated methionine from 
cytoplasmic actin. 

Because ACTMAP was an uncharacterized 
protein, we analyzed the structure predicted 
in the AlphaFold Protein Structure Database 
(15). Its C-terminal domain (UPF0692, domain 
of unknown protein function) and proline-rich 
N-terminal region were predicted to have a 
globular fold and to be disordered, respectively 
(Fig. 2, A and B, and fig. S11A). Furthermore, 
protein structure comparisons (6) against ex- 
perimentally determined structures in the Pro- 
tein Data Bank (PDB) found that the UPF0692 
domain was similar in structure to bacterial 
cysteine proteases such as the putative C39-like 
peptidase (PDB identifier: 3ERV; Dali z score: 
15.5; root mean square deviation: 2.3A) despite 
only sharing 12% sequence identity (fig. S11, 
B and C). A conserved cysteine (Cys’”) and 
histidine (His”**) in ACTMAP were positioned 
in a potential catalytic dyad, as observed in 
cysteine peptidases (Fig. 2, A and B, and fig. 
S11B). To determine whether ACTMAP is in- 
deed a cysteine protease, we complemented 
ACTMAP-knockout cells with wild-type (WT) 
and mutant ACTMAP transgenes. WT ACTMAP 
fully restored actin processing, whereas a point 
mutation affecting the cysteine in the predicted 
catalytic site (C132A) prevented this (Fig. 2C 
and fig. S12, A and B). Thus, ACTMAP cysteine 
protease activity is essential for actin maturation. 

Next, we generated recombinant, full-length 
ACTMAP WT and C132A protein to test the 
enzymatic activity in vitro (Fig. 2D and fig. S13, 
A and B). Recombinant ACTMAP efficiently 
catalyzed N-terminal cleavage of immature 
actin in cellular extracts (Fig. 2E and fig. S14A). 
Purified green fluorescent protein-tagged 
B-actin was also processed by recombinant 
ACTMAP, albeit less efficiently (fig. S14, B 
and C). This activity required the catalytic site 
of ACTMAP, because C132A ACTMAP was 
inactive (Fig. 2E and fig. S14, A and C). The 
structural model of ACTMAP showed that 
Cys’? was accessible through a pocket lined 
with conserved, positively charged residues 
that, in contrast to METAP1/2 (17), could ac- 
commodate negatively charged N termini 
such as actin (fig. S15A). Moreover, a Colabfold 
model (78) of the plausible ACTMAP: f-actin 
complex placed the methionated B-actin 
N terminus within the positively charged 
catalytic pocket of ACTMAP (fig. SI5B). On 
the basis of this prediction, we could validate 
binding between actin and ACTMAP in vitro 
(fig. SI5C). 

To address the specificity of ACTMAP cleavage 
activity for actin, we analyzed our whole-cell 
proteomics dataset of WT and ACTMAP- 
knockout cells. ACTMAP loss selectively affected 
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Fig. 2. ACTMAP cysteine protease activity specifically cleaves actin 

N termini. (A) Predicted catalytic residues Cys'S* and His?** (red) in human 
ACTMAP. (B) Alphafold model (ACTMAP residues 71 to 351) with predicted 
catalytic dyad (bottom) in red. (€) Actin processing in ACTMAP-knockout 
cells expressing hemagglutinin (HA)-tagged WT or C132A ACTMAP. 

(D) Coomassie gel of recombinant ACTMAP. (E) Actin processing in 
ACTMAP-knockout lysates incubated with recombinant WT or C132A ACTMAP 
for 30 min. (F) Abundance of robustly detected N termini without initiator 
methionine in WT cells (317 proteins, x axis) and ACTMAP-knockout cells 


(y axis) quantified by label-free mass spectrometry 


mean + SD; n = 3 clones). (@) HAP1 (CTRL) and ACTMAP knockouts 


actin across all robustly detected protein 
N termini with methionine removal (Fig. 2F and 
data S3). This specificity for actin was not 
solely determined by the N-terminal sequence, 
because other proteins with N-terminal sequen- 
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LFQ) (data are shown as 


ces similar to that of actin (M-[DE]-[DE]-[DE]) 
retained their initiator methionine in cells (fig. 
S15D) and thus are not ACTMAP substrates. 
The interactome of ACTMAP was enriched 
for actin and actin-associated proteins, sup- 


expressing HA-tagged ACTMAP subjected to HA pulldown and analyzed by 
mass spectrometry. Plot shows mean log-fold change (FC) from triplicates 
plotted against the —-logio(FDR) permutation-based t test. Dashed lines 
represent FDR t test P < 0.05 and logs(FC) > 4. Significantly enriched 
proteins are labeled in addition to ACTG1 and PFNI. (H) Affinity of PFNs 

(x axis) for fluorescent ACTMAP. Frorm (y axis) denotes change in fluorescence 
measured by microscale thermophoresis. (1) Mean dissociation constant 

(Ka) from (H). PFN2A, Kg = 0.12 + 0.03 uM; PFN2B, Ky = 0.28 + 0.09 uM; 
PFNI, Ky = 9.2 + 3.10 uM. (J) Mean Ky values as in (I) to (H) comparing PFN2A 
binding with ACTMAP FL (Kg = 0.12 + 0.03 uM) and AN (Kg = 0.39 + 0.06 uM). 
(K) As in (E) comparing ACTMAP FL with residues 88 to 351 (AN). 


porting the idea that ACTMAP specifically 
targets actin (Fig. 2G and data S4). The mono- 
meric actin-binding protein profilin-2 (PFN2) 
was about fivefold enriched compared with 
its paralog PFNI1, which is ~10-fold more 


30 SEPTEMBER 2022 + VOL 377 ISSUE 6614 1535 


RESEARCH | REPORTS 
A 6 F 
we & Ne s & . ; 
S CS R Qe gy > immature B-actin 
+/+ -/- +/+ -/- 4/4 -/- +/+ -/- 41+ -/- 4/4 -/-_ : Actmap : . 
|exas@ ey |e Mature y-actin , 
— | = ie 
oy, eal | |||] immature B-actin 
7 Q 
[a e| see | A |] —— ne ||ittree me] ieee Pan-actin ss] 
37 £ 
x 
C 
a-skeletal a-cardiac a-smooth 
=> Ac-DE.. © = Ac-DD.. © a Ac-EE.. © © Mature 
= 100 Se 100-4 poe A7 OPP. ® & 1004 oq “CEE °2 | © Immature 
1S) 1e) oO 
KS KS KS \ 
i= = = 7 
E 504 E 501 E 501 ay 
2 eC) 2 = 
z z z o 
x x x < 
o- 0- o- 
+/+ -/- +/+ -/- +/+ -/- 
Actmap Actmap Actmap mouse liver 


Fig. 3. ACTMAP is required for cytoplasmic and muscle actin maturation in mice. (A) Indicated tissues and organs collected from WT (+/+) or 
Actmap-deficient (-/—) mice were subjected to immunoblot analysis to monitor actin processing. (B) Immunohistochemistry of immature B-actin 


staining in liver sections of the indicated genotypes. Scale bar, 50 um. (C) 


of total) from mouse muscle tissue analyzed by LFQ. Data are shown as means + SEM (n = 3 mice). 


Fig. 4. ACTMAP is required A 
for muscle strength. (A and , 
B) Grip strength of male (A) <2 
and female (B) mice normal- 2D 
ized to their bodyweight. n = 5 2 
4 independent experiments £9 
(color coded) with =6 mice per 22 


condition. Data are shown as 
means + SEM. *P < 0.05, 
unpaired t test. (©) Schematic 
diagram of ex vivo muscle 
force measurement (left) and 
quantification (right) presented 
as percentage force loss. Data 
are shown as means + SEM (n = 
6 mice). *P < 0.05, unpaired 
t test. (D) Isolated stretched 
soleus muscles were cryopre- 
served, longitudinal sections 
were stained with phalloidin, 
and the length of sarcomeric 
F-actin filaments was analyzed 
by stimulated emission deple- 
tion microscopy. Shown are 
examples of phalloidin staining 
(top) and quantification of 
F-actin filament length (bottom). 
Black bar indicates mean (n > 
85 sarcomeres and n = 3 or 

4 mice per condition; ***P < 
0.001, Welch's unpaired t test). 
(E) Hematoxylin and eosin 
stain of soleus muscle in 50- 
week-old mice. White arrows 
point to centralized nuclei. 
Scale bar, 100 um. (F) Quanti- 
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abundant in HAP! cells (19). NAA80 also 
preferentially associates with PFN2 over 
PFNI1. This interaction stimulates N-terminal 
acetylation of actin by NAA80 in vitro, al- 
though loss of PFN2 did not affect steady- 
state actin maturation in HAP! cells (fig. S16A) 
(7, 19). 

To study ACTMAP-PEN interactions, we puri- 
fied recombinant PFN2A, PFN2B, and PFN1 
(fig. S16, B and C). Binding affinities measured 
in solution by microscale thermophoresis re- 
vealed that ACTMAP bound all PFNs but 
preferred PFN2 isoforms, with a 50- to 100-fold 
lower dissociation constant (Kg) compared 
with PFN1 (Fig. 2, H and I). The disordered 
N-terminal part of ACTMAP contains poly- 
proline regions (Fig. 2A), which is a common 
PFN-binding motif (20). Deletion of the proline- 
rich ACTMAP N-terminal region (AN, amino 
acid residues 88 to 351) decreased binding to 
PFN2A and PFN2B by ~70% (Fig. 2J and fig. 
S16D). Recombinant ACTMAP AN protein also 
processed immature actin about sixfold less 
efficiently than full-length ACTMAP in protein 
extracts of ACTMAP-knockout cells (Fig. 2K). 
Consistently, inducible reconstitution of exog- 
enous ACTMAP in ACTMAP-deficient cells con- 
firmed that AN was less efficient in processing 
immature actin compared with full-length 
ACTMAP (fig. S16, E and F). Thus, the proline- 
rich ACTMAP N terminus enhances PFN2 bind- 
ing and facilitates efficient actin processing. 

To address the physiological role of ACTMAP, 
we disrupted Actmap (Mus musculus gene 
BC024978) in mice (fig. S17A). Homozygous 
Actmap-knockout mice were viable and born 
at the expected Mendelian ratio (fig. S17B; P = 
0.97, chi-square test; m = 200 mice) and dis- 
played no obvious genotype-specific morbidity 
during their first year of life despite the ab- 
sence of cytoplasmic actin processing across all 
inspected tissues (Fig. 3, A and B, and fig. S17C). 
Processing of muscle actin involves a different 
route that requires excision of an N-terminal 
acetylated cysteine, likewise by an unknown 
enzyme (fig. SI). To determine whether ACTMAP 
is also required for muscle actin maturation, 
we performed quantitative mass spectrometry 
on mouse muscle tissue. We found that the 
N termini of o-skeletal (ACTAD), o-cardiac 
(ACTC1), and a-smooth (ACTA2) actin com- 
pletely retained their (immature) acetyl cys- 
teine in an Actmap-deficient background 
(Fig. 3C). Thus, in vivo, ACTMAP is the sole 
enzyme responsible for removing the acety- 
lated methionine and cysteine of cytoplasmic 
and muscle actin, respectively, and mice are 
viable despite a cytoskeleton composed of im- 
mature actin molecules. 

We found that loss of Actmap was asso- 
ciated with decreased muscle strength by 
~12% and ~11% in male and female mice, re- 
spectively, as measured by grip strength 
(Fig. 4, A and B). To directly investigate single- 
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muscle mechanics, we measured the ex vivo 
contractile parameters of isolated intact soleus 
muscles (21). Soleus muscles from Actmap- 
knockout mice generated ~11% lower absolute 
maximal force compared with those of WT 
littermates (Fig. 4C). Actin, the main protein 
in sarcomeric thin filaments in muscle, is 
important for generating force and move- 
ment by interacting with myosin in the thick 
filaments. To determine whether Actmap 
loss affects thin filaments, we measured thin 
filament length in longitudinal sections of 
soleus muscle (22). Thin filaments in Actmap- 
knockout mice were on average ~8.4% shorter 
compared with WT (Fig. 4D). We next exam- 
ined muscle morphology and observed an 
age-dependent, eightfold increase in cen- 
trally located nuclei within muscle fibers of 
50-week-old Actmap-knockout mice compared 
with WT mice (Fig. 4, E and F). Centrally lo- 
cated nuclei in muscle fibers are a hallmark 
of ongoing muscle repair after injury and are 
often observed in progressive muscle patholo- 
gies (23). Thus, Actmap loss affects muscle 
strength, thin filament length, and the accu- 
mulation of centrally located nuclei. However, 
the current studies do not establish a causal 
relationship between these observations, and 
we cannot rule out a contribution of ACTMAP- 
binding proteins (such as PFNs) to these 
phenotypes. 

This study identifies ACTMAP as the missing 
enzyme for cytoplasmic and muscle actin bio- 
synthesis, demonstrating that it functions as 
a cysteine protease that specifically cleaves 
the actin N terminus but not the termini of 
other analyzed proteins. Unexpectedly, Actmap- 
knockout mice are viable despite the lack of 
actin N-terminal processing, although this comes 
at the cost of decreased muscle strength. It will 
be of interest to study whether the Drosophila 
homolog of ACTMAP is involved in the pheno- 
type of a previously described flightless mu- 
tant displaying defective actin processing 
(24, 25). Considering that ACTMAP is con- 
served across all species known to modify 
actin N termini (fig. S18), this factor most 
likely is the universal enzyme for the genera- 
tion of mature actin. 
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SUPERCONDUCTIVITY 


Promotion of superconductivity in magic-angle 


graphene multilayers 


Yiran Zhang'*+, Robert Polski??+, Cyprian Lewandowski7* 


, Alex Thomson”**, Yang Peng®, 


Youngjoon Choi??, Hyunjin Kim+??, Kenji Watanabe®, Takashi Taniguchi®, Jason Alicea, 


Felix von Oppen’, Gil Refael**, Stevan Nadj-Perge 


1,25 


Graphene moiré superlattices show an abundance of correlated insulating, topological, and 
superconducting phases. Whereas the origins of strong correlations and nontrivial topology can be 
directly linked to flat bands, the nature of superconductivity remains enigmatic. We demonstrate that 


magic-angle devices made of twisted tri-, quadri-, 


and pentalayer graphene placed on monolayer 


tungsten diselenide exhibit flavor polarization and superconductivity. We also observe insulating 
states in the tril- and quadrilayer arising at finite electric displacement fields. As the number of layers 
increases, superconductivity emerges over an enhanced filling-factor range, and in the pentalayer 

it extends well beyond the filling of four electrons per moiré unit cell. Our results highlight the role of 
the interplay between flat and more dispersive bands in extending superconducting regions in 


graphene moiré superlattices. 


ich phase diagrams of quantum elec- 

tronic phases have been realized in many 

graphene superlattice structures but so 

far robust superconductivity has been 

exclusive to twisted bilayer graphene 
(TBG) U, 2) and twisted trilayer graphene 
(TTG) (3, 4). Notably, TTG exhibits greater 
electric field tunability (3, 4), Pauli limit viola- 
tion (5), and more strongly coupled supercon- 
ductivity (3, 4) in comparison to TBG. Although 
these observed differences may serve as clues 
for identifying the superconducting origin of 
these systems, our ability to identify the truly 
universal features is ultimately limited by a lack 
of robust superconducting moiré materials, sug- 
gesting that further progress lies in the discovery 
of other moiré superconducting systems. 

We investigate twisted graphene multilayers 
in which each successive layer is twisted by an 
angle +0 relative to the previous one in an al- 
ternating sequence (Fig. 1A). For an even number 
n of layers, the spectrum at zero displacement 
field D is expected to separate into n / 2 inde- 
pendent TBG-like bands, each characterized by 
a different effective twist angle. When the num- 
ber of layers 7 is odd, in addition to (n - 1) / 2 
TBG-like bands, one monolayer graphene (MLG)}- 
like band (essentially a Dirac cone) is expected 
(6) (see left column of Fig. 1B for examples 
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when 7 is 3, 4, and 5). The system may be con- 
veniently modified through the application of 
a displacement field D, which controllably hy- 
bridizes the different bands (Fig. 1B, right col- 
umn). Experimentally, we explore the properties 
of alternating twisted tri-, quadri-, and penta- 
layer graphene (TTG, TQG, TPG, respectively) 
structures with 6 = 1.52 + 0.02° (device D1, tri- 
layer), 9 = 1.80 + 0.04° (D2, quadrilayer), and 
0 = 1.82 + 0.05° (D3, pentalayer), respectively 
(7). These twist angles all lie close to the theoret- 
ically predicted “magic” values needed to ob- 
tain one set of flat TBG-like bands [07g ° = 
VaoTgE* = 1500, onge” = (v5 + eras" 
= 1.75°, and Ome” = 3078 ~1.87°, assum- 
ing an effective TBG twist angle Cue” = 1.08° 


(6); see section 4.1 in (7)]. We find that TTG, 
TQG, and TPG all exhibit hallmark signatures 
of strong correlations (Fig. 1, C to E), including 
robust superconductivity and flavor symmetry 
breaking as revealed by pronounced resistance 
peaks around certain integer filling factors v 
[number of electrons per moiré site; see section 
1in (7) for assignment of v]. 

In addition to the symmetry-breaking tran- 
sitions previously reported in TTG (3, 4, 8), our 
TTG structure [coupled to a tungsten disel- 
enide (WSe,) monolayer] (9) exhibits a previ- 
ously unobserved correlated insulating state 
near v = +2 at finite D [Fig. 1C, inset; see also 
fig. S4, Fig. 1B right column, and sections 3 
and 4 in (7)], which originates from the inter- 
play between an interaction-driven cascade 
transition and hybridization induced by 
the D field [e.g., as captured by (10, 11)]. We 
have also detected an insulating state 
developing at finite D fields in TQG near 
charge neutrality [Figs. 1D (inset) and 1G]. 
However, in contrast to TTG the TQG in- 
sulating state can be explained through the 
D-induced hybridization only. Notably, the 


detection of insulating gaps in TTG and TQG 
implies a low level of disorder in our samples 
(see also fig. S1). 

The superconducting regions in all three 
structures extend over considerably larger filling 
factor ranges compared with TBG (2, 9, 12, 13) 
where superconductivity is typically observed 
within 2 < |v| < 3. Moreover, superconductiv- 
ity emerges over successively broader regions 
of phase space, reaching v ~ +5 on the electron 
side for TPG (Fig. 1, C to E). Along with a zero 
longitudinal resistance R,, observed in the 
characteristic v versus T dome (Fig. 1, H to J), 
we also measure large critical currents (~400 nA) 
and occasionally see oscillations in critical cur- 
rent caused by superconducting interference, 
substantiating the robustness of phase coher- 
ence (7). Moreover, we measure high critical 
perpendicular magnetic fields B, (typically 
~0.8 T; see fig. S3), indicating that the corre- 
sponding Ginzburg-Landau coherence lengths 
Egy, (~10 to 30 nm) are substantially smaller 
than those observed in TBG and deviate from 
the weak-coupling prediction, Eg, ~ hug /nA 
with A = 1.76kg7,; this suggests a strong- 
coupling origin of superconductivity (3, 4) [see 
section 2 in (7)]. When combined with other 
recent experiments (5, 14, 15), these observations 
affirm the unconventional nature of super- 
conductivity within the entire class of graphene 
moiré systems. Further, the measurements on 
three to five layers indicate that the addition 
of layers promotes superconductivity over a 
broader filling window despite the coexist- 
ing dispersive bands as well as the ostensibly 
increased vulnerability to disorder—both from 
the additional twist angles and from the sen- 
sitivity to the relative displacement between 
layers (fig. S14). 

In addition to the pronounced v-dependence, 
the observed superconducting pockets are 
highly tunable with electric displacement field 
D (Fig. 2). A comparison of the three structures 
reveals, however, that TQG and TPG are more 
tunable than TTG. This is apparent both in the 
D-dependent evolution of the filling range in 
which superconductivity is measured (Fig. 2, A 
to C) as well as in the critical temperature 7, 
(Fig. 2, D to F). Notably, superconductivity in 
TQG and TPG is fully quenched for all fillings 
at D/é9 = 0.75 Vnm~! and D/e = 0.6 Vam7, 
respectively. In the case of TTG, however, super- 
conductivity is present up to the maximum 
accessible electric field D/eé9 = 1 Vnm~'. Never- 
theless, R,,, versus D and temperature mea- 
surements do show that superconductivity is 
suppressed for large D fields at optimal doping 
(the doping at which superconducting 7; is the 
highest) in all three heterostructures; further, 
they reveal that 7, forms a D symmetric dome 
maximized at small finite D (Fig. 2, D to F, for 
electron-side data showing similar behavior 
see fig. S5). We also note that TTG, TQG, and 
TPG all exhibit similar variation in 7, when 
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Fig. 1. Superconductivity and correlated insulators in alternating twisted 
graphene multilayers. (A) Schematics of the alternating twisted graphene 
multilayers in which each successive layer is twisted by an angle +6 relative to 
the previous one in an alternating sequence. (B) Band structure of twisted 

tri-, quadri-, and pentalayer graphene (from top to bottom) for angles close to 
the theoretical magic angle at zero D field (left) andD/eg = 0.4 Vnm* (right) for 
valley K [section 4 in (/)]. (€ to E) Line cuts of R, versus filling factor v for a 
range of temperatures (shown are traces taken first at 25 mK, then every 0.25 K 
from 0.25 K to 2 K, followed by every 1 K from 3 K to 7 K), from top to bottom 


-4 -3.5 -3 y 25 =2 1, 


measured at D/eg = 0.22 Vnm7! (C), —0.15 Vnm= (D), and 0 Vnm~ (E), 
respectively. Inset of (C) is conductance versus T' for TTG showing thermal 
activation behavior [D/é9 = 0.26 Vnm~?, section 3 in (7)]. Inset of (D) shows 
charge-neutrality insulators in TQG at larger electric fields. (F) R,x versus 
temperature and v for the trilayer focusing around v = +2 at D/eo = 0.26 Vnm“?. 
(G) R,, versus temperature and D field for the quadrilayer focusing near charge 
neutrality. (H to J) R,. versus temperature and v for hole doping, showing 
superconducting domes around v = —2 in the same systems and for the same D fields 
as in (C) to (E). 


viewed as a function of the potential difference 
U between the top and bottom layers [fig. S5, 
D and E; see also section 3 in (7) for the energy 
conversion from D to UJ]. This layer-number 
invariance is consistent with noninteracting 
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continuum-model calculations tracking the 
evolution of the inverse of the flat-band band- 
width with U (Fig. 2G, bottom). The depen- 
dence of JT, on D in all devices qualitatively 
matches the predictions of (16) for TTG with 


one marked exception: the observed vanish- 
ing of superconductivity and the decay of 7, 
appears to be linear in D (Fig. 2, Eand F, and 
fig. S5), in line with predictions for multi- 
layer graphene with rhombohedral stacking 


30 SEPTEMBER 2022 * VOL 377 ISSUE 6614 1539 


RESEARCH | REPORTS 


A 


14 Trilayer 
1 


i 


Quadrilayer 
D2 


Die, oe Fe 


852 <4 


= 


E 


Die, (V nm") 


0 R,, (ke 
a 
40 
0.5 
20 
0 ) 
-20 
-0.5 -0 
a age = 
T (K) 


——— 
40 
6» 
-40 
-0.5 


=e Trilayer 
=e= Quadrilayer 
=e= Pentalayer 


0 01 02 03 04 05 0.6 


= Die, (V nm“) 
£ 0.1 
D> 
x 0.08 
E 
~40 £0.06 
= 
0.04 
7 
T (K) i) 20 40 #60 ~ °# 80 
U (meV) 


Fig. 2. TTG, TQG and TPG phase diagrams and electric field-tunable super- 
conductivity. (A to C) R,, versus filling factor v and displacement field D for 
twisted trilayer (A), quadrilayer (B), and pentalayer (C) graphene, respectively. 
All data are taken at 25 mK, and the dark blue regions signal superconductivity. 
For electron-doped TTG and TQG, superconducting regions extend toward 

= +1 at intermediate D field. (D to F) R, versus temperature and D [or equivalent 
potential difference U between layers, see section 3 in (7)] for the filling factors 


that delineates 10% of the normal state resistance [section 2 in (/)]. T. is maximized 
at finite D fields. Overall, superconductivity is suppressed more easily with D as the 
layer number is increased for both hole (D to F) and electron (fig. S5) doping. 

(G) Theoretical calculations of the inverse of the flat-band bandwidth for twisted 
trilayer, quadrilayer, and pentalayer graphene as a function of D/eg (top) and potential 
difference U (bottom). For a fixed D, the bandwidth of the flat bands is larger for 
systems with more layers, but when expressed as a function of U, the flat-band 


indicated by arrows in (A) to (C). Critical temperature 7, 


(17) and in contrast to the exponential “tail” 
typically expected from the weak-coupling 
theory [and seen in the model of (16)]. 
Comparing the location of the supercon- 
ducting regions with the evolution of the Hall 
density as a function of D and v in TTG, TQG, 
and TPG provides further insight into the in- 
tricate relationship between the superconduct- 
ing phase and the correlation-modified Fermi 
surface (Fig. 3). As in previous TBG and TTG 
measurements, we observe symmetry-breaking 
electronic transitions (a “cascade” of transi- 
tions) that are signaled by sudden drops in 
the Hall density magnitude (a “reset”) with- 
out a change in sign. These resets (see dashed 
lines in Fig. 3, A to D) indicate a rearrangement 
of spin and valley sub-bands and typically occur 
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is indicated by a dashed line 


near integer fillings of the flat bands (78, 19). 
At low D fields, superconducting pockets onset 
around the |v| = 2 resets (purple dashed line), 
and the filling extent of superconductivity 
varies depending on the presence or absence 
of a |v| = 3 flavor symmetry-breaking transi- 
tion (gray dashed line). For electron- and hole- 
doped TTG as well as for electron-doped TQG 
(Fig. 3, A, B, D), a flavor symmetry-breaking 
transition appears around |v| = 3 and super- 
conductivity accordingly terminates, as pre- 
viously noted in TTG (3). By contrast, when 
signatures of the |v| = 3 reset are completely 
absent (for example in hole-doped TQG, Fig. 3C, 
or TPG), superconductivity extends much fur- 
ther. Combined, these observations suggest 
that superconductivity is favored when only 


broadening follows a similar trend across the different structures [section 4.1 in (/)]. 


two out of the four flavors are predominantly 
populated (|v| = 2 cascade) and suppressed 
beyond |v| = 3 resets. This behavior can be un- 
derstood within the simplest iteration of the 
cascade scenario: resets associated with |v| = 3 
produce spin- and valley-polarized bands (20-22) 
and naturally disfavor Cooper pairing of time- 
reversed partners. 

At high D fields, signatures of the cascade 
vanish and van Hove singularities (VHs) become 
more prominent instead, reflecting qualitative 
changes in the band structure (see yellow lines 
in Fig. 3, Ato D and fig. S6 that track the vHs). 
Consistent with previous TTG measurements 
(3, 4), the vHs in our TTG sample (as well as in 
TPG, see Fig. 3, E and F) crudely bound the 
superconducting regions. By contrast, the vHs 
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Fig. 3. Interplay between superconductivity, flavor symmetry-breaking 


transitions and van Hove singularities in TTG, TQG 


D field and v dependence of R,, (top) and Hall density (bottom, measured 
at B = 0.9 T) for TTG at 25 mK. Purple and gray dashed lines mark the filling 


factors in which flavor symmetry-breaking transitions 
and |v| = 3 happen, respectively. The yellow line in (A) 


the vHs. (C and D) D field and v dependence of R,, (top) and Hall density 
(bottom, measured at B = 1.5 T) for TQG at 25 mK. Superconducting T, reaches 


its maximum [orange dot in (D)] exactly at the position 
flavor symmetry-breaking transitions around |v| = 3 coin 


in TQG cross well into the superconducting 
pockets—in fact, for electron doping, 7’, reaches 
its maximum exactly at the position of the VHs 
(Fig. 3D, orange dot, and fig. S7, D to F). The 
interplay between the vHs and superconduct- 
ing boundaries as revealed in Hall density 
measurements is complex: 7, can be both en- 
hanced and suppressed at the vHs depending 
on the layer number and possibly other details 
such as the precise twist angle (see fig. S11 for 


data from the second TQG device). 
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and TPG. (A and B) 


associated with |v| = 2 
delineates the evolution of 


of the vHs. When present, 
cide with the termination 


Pentalayer measurements provide additional 
signatures that point toward a close relation 
between superconductivity and flavor symmetry- 
breaking cascades (Fig. 3, E and F). In contrast 
to TTG, in TPG we can access D fields that 
are large enough to stifle superconductivity, 
which occurs simultaneously with the onset of 
the vHs and the apparent suppression of the 
cascade transitions (see red and light blue 
lines in Fig. 3F that mark the superconducting 
boundaries and the cascade transitions, respec- 


afm ee ee ee = 


of superconductivity [(A), (B), and (D)]. By contrast, superconductivity extends 
much further in the absence of a |v| = 3 reset (C). (E) D field- and v-dependence 
of Hall density for TPG measured at T = 25 mK and B = 1.5 T. (F) Schematic 
of Hall density (E) and R,, (Fig. 2C) features for the pentalayer, including 

the boundary of the superconducting region (red), vHs/“gap” (dark blue), cascade 
(light blue), and |vsat| = 4 Hall density reset (light purple). Sketches of the DOS 
around v = +2 for different D fields are shown on the right. The middle panel 
illustrates the flavor symmetry polarization observed in regions that support 
superconductivity. Flavor symmetry is preserved at higher D fields, as shown in 
the top and bottom panels. 


tively). For example, at low D fields (|D|/eo < 
0.6 Vnm~!) around v = +2, the Hall density re- 
sets close to zero, in line with a nearly complete 
flavor symmetry-breaking polarization. How- 
ever, at higher D fields (|D|/e9 > 0.6 Vnm~), 
the Hall density is dominated by a vHs around 
v = +2 and the cascade signatures are dimin- 
ished. Superconductivity accordingly also van- 
ishes. For hole doping, the disappearance of 
superconductivity similarly coincides with 
the weakening of the cascade. This on and off 
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Fig. 4. Extended superconducting pockets in TPG. (A) FR, versus v and 
temperature at zero D field for twisted pentalayer graphene. (B) R,, versus 
temperature and v on the electron side at D/eg = 0.17, 0.32, and 0.44 Vnm7!. 
The evolution of the superconducting domes and resistance peaks near v = +2 
and v = +4 with D is shown. (C and D) D field and v dependence of R,x (C) and 
Hall density [(D), measured at B = 1.5 T)] at 25 mK, showing the region 
around the electron-side superconducting pocket. The gray line in (C) marks the 
VHs originating from the dispersive TBG-like bands (see also fig. S10). The 
inset shows the evolution of dV/di/ as a function of | and B at v = +4.6, 

D/eg = 0.12 Vnm~ (marked by a yellow dot in the main panel), confirming the 


robustness of the superconductivity above v = +4. (E) Line cuts of Rx (top, 
measured at T = 25 mK) and Hall density (bottom, measured at T = 1.5 K, 

B = 0.5 T) versus v for a range of D fields (traces are shown for every 0.05 for 
both R,, and Hall density). Both the presence of Hall density resets around 

v = +4 and the development of superconductivity extending from v = +2 to 

+5 are shown to persist for a wide range of D fields. (F) Schematic of scenario (ii) 
with a Hartree correction for superconductivity at v = +5. The Hartree correction 
shifts the dispersive TBG- and MLG-like bands down in energy, which causes 
the flat TBG-like bands to fill more slowly with doping, thus allowing them to host 
superconductivity at v > +4. 


correspondence between the two phenomena 
suggests that they either share a common 
origin—such as a large DOS—or that the cas- 
cade serves as a prerequisite for robust super- 
conductivity in graphene moiré superlattices. 

As mentioned above, for low D fields in TPG, 
the superconducting pockets are extraordi- 
narily large, spanning —4 < v < —2 for hole 
doping and +2 < v < +5 for electron doping 
(Figs. 1E, 2C, and 4). In particular, the electron- 
side range corresponds roughly to a density 
window of 6 x 10’? em, which is the largest 
filling range so far reported in a graphene- 
based superconductor. The observed super- 
conductivity exhibits values of T, and B, (fig. 
$3) similar to that of the tri- and quadrilayer 
samples and is likewise accompanied by a 
weak oscillating pattern of critical current 
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(Fig. 4C, inset), confirming superconducting 
phase coherence. We emphasize that the un- 
precedented persistence of superconductivity 
across a large filling factor range in TPG (and 
also TQG in comparison to TTG or TBG) can- 
not be explained in a minimal framework of 
alternating twisted graphene multilayers (6, 23) 
without invoking the nontrivial role of the 
additional bands. The role of the additional 
bands in TPG deserves careful consideration 
because of the implications for the strength of 
interactions (such as Hartree effects) and the 
types of superconductivity the bands can plau- 
sibly support. Explanations for the enlarged 
superconducting intervals can generically be 
organized into three scenarios depending on 
the filling of the flat TBG-like bands vaat, rela- 
tive to the total filling vy, at which supercon- 


ductivity terminates (Vmax = +5 for electron- 
doped TPG and |vmax| = 4 for TQG and hole- 
doped TPG). In scenario (i), Vmax corresponds 
to Vaat * +3, the flat band filling at which 
superconductivity is typically suppressed in 
TBG, suggesting that the superconducting 
phase space is largely the same for different 
multilayer magic-angle structures when con- 
sidering just the flattest TBG-like bands. In 
scenario (ii), Vmax coincides with vat = +4, pre- 
cluding any simple analogy with TBG, although 
superconductivity can still be attributed to 
the flat bands. Finally, scenario (iii) assumes 
full filling of the flat bands before supercon- 
ductivity is suppressed at Vmax. This scenario 
includes the possibility that the distinction 
between the different TBG- and MLG-like 
bands breaks down even at D = 0 owing to 
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hybridization as well as potential multi-band 
superconductivity (24-27). In this case, super- 
conductivity in TPG is a more general phe- 
nomenon than in TBG because it occurs in 
either mixed bands or new, more dispersive 
bands [see section 5 in (7)]. 

From the perspective of the noninteracting 
band structure, the three scenarios all seem 
implausible and therefore interactions must 
play a crucial role. In particular, although the 
presence of the dispersive bands implies that 
|v| — |Vaat] > 0, this effect is much smaller than 
needed for either scenario (i) or (ii). Coulomb 
interactions can considerably enhance |v|— 
|Vaat|, either by evening out the spatial charge 
distribution (28-32) or by symmetry break- 
ing. A simple model for TPG incorporating 
these mechanisms [section 4.5 in (7)] suggests 
a minimal flat-band occupation vga = +3.8 
at v = +5, diminishing the plausibility of 
scenario (i) for electron-doped TPG which has 
Vmax * +5. The relevance of this scenario is 
further undermined by the observation of VHs 
at v = +5 (fig. S10D): under the reasonable 
assumption that the noninteracting band 
structure remains valid for the dispersive 
TBG-like bands (apart from a Hartree shift), 
scenario (i) would instead place the observed 
vHs near v = +5. Taken together, these argu- 
ments effectively rule out scenario (i). Note, 
however, that the presented line of reasoning 
is not straightforward for the other super- 
conducting pockets [see section 5 in (7)]. 

Both scenarios (ii) and (iii) are indicative of 
the nontrivial role of additional bands in sta- 
bilizing superconductivity. Assuming well- 
defined flat and dispersive bands, in scenario 
(iii) the former bands are completely filled 
and superconductivity is supported fully by 
the latter nonflat bands. This assertion is at 
odds with the large dispersion of the remaining 
TBG- and MLG-like bands. However, although 
the exact mechanism underlying scenario (iii) 
is difficult to pin down, it is not without ex- 
perimental support. For instance, a natural 
interpretation of the Hall density minimum 
around v ~ +4 for |D| < 0.4 Vnm_? is that it 
marks the complete filling of the flat bands, 
Vaat * +4 [Fig. 4E, fig. S9, and section 5 in (7)]. 

One possible realization of scenario (iii) 
consistent with the experimental observa- 
tions is that the division of the electronic 
states into simple TBG- and MLG-like bands 
fails—obviating our very definition of vat 
and potentially allowing flavor polarization 
and accompanying superconductivity to per- 
sist well beyond v = +4. Although such hy- 
bridization is expected for finite D fields, 
mixing between flat, dispersive TBG- and 
MLG-like bands for |v| < |vmax| May occur 
even at D = 0 thanks to, for example, prox- 
imity to WSeg, layer-to-layer charge inhomo- 
geneity, or distant-layer coupling [see sections 
4.4 and 4.5.3 in (7)]. 
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Our measurements demonstrate the increas- 
ing predominance of superconductivity in 
twisted graphene multilayer structures as the 
number of layers is increased from three to 
five and highlight the close relationship be- 
tween the flavor symmetry-breaking transitions 
and superconductivity. Moreover, our findings 
suggest a scenario in which the symmetry- 
broken v = +2 state strongly favors the for- 
mation of the superconducting state whereas 
the cascade corresponding to v = +3 sup- 
presses it. Notably, this scenario is consistent 
not only with previous TBG (2, 9, 12, 13, 33, 34) 
and TTG (3, 4, 8) observations but also in part 
with the recently investigated ABC trilayers 
(35) and Bernal bilayers with and without 
(36) a WSe, substrate (37) in which super- 
conductivity is observed near or within phases 
with two out of four flavors being predomi- 
nantly filled. These commonalities suggest 
that symmetry-broken states with similar types 
of polarization underlie superconductivity in 
all of these graphene-based superconductors. 
In this context, the discovery of superconduc- 
tivity in TQG and TPG together with recent 
work on untwisted bi- and trilayers consid- 
erably expands the scope of graphene-based 
superconductors. This expansion holds prom- 
ise for resolving ongoing questions related to 
the nature of the pairing mechanism in these 
systems and provides guidance for developing 
novel graphene-based superconductors and 
their applications. 

Note added in proof: After submission of 
this report, we have become aware of related 
work (38). 
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OCEAN ACIDIFICATION 


Climate change drives rapid decadal acidification in 
the Arctic Ocean from 1994 to 2020 


Di Qi??5, Zhangxian Ouyang’, Liqi Chen’2*, Yingxu Wu’, Ruibo Lei®, Baoshan Chen‘, 
Richard A. Feely®, Leif G. Anderson’, Wenli Zhong®, Hongmei Lin?, Alexander Polukhin®, 
Yixing Zhang”, Yongli Zhang”°, Haibo Bi”, Xinyu Lin”, Yiming Luo’, Yanpei Zhuang’, 


Jianfeng He®, Jianfang Chen’, Wei-Jun Cai** 


The Arctic Ocean has experienced rapid warming and sea ice loss in recent decades, becoming the first 
open-ocean basin to experience widespread aragonite undersaturation [saturation state of aragonite 
(Qarag) < 1]. However, its trend toward long-term ocean acidification and the underlying mechanisms 
remain undocumented. Here, we report rapid acidification there, with rates three to four times higher 
than in other ocean basins, and attribute it to changing sea ice coverage on a decadal time scale. Sea ice 
melt exposes seawater to the atmosphere and promotes rapid uptake of atmospheric carbon dioxide, 
lowering its alkalinity and buffer capacity and thus leading to sharp declines in pH and Q,-a9. We predict 
a further decrease in pH, particularly at higher latitudes where sea ice retreat is active, whereas 
Arctic warming may counteract decreases in Q,-a, in the future. 


n the global ocean, an increase in anthro- 

pogenic carbon dioxide (CO.) has led to 

decreases in seawater pH based on the 

total hydrogen ion concentration scale 

(pH) and the saturation state of the cal- 
cium carbonate mineral aragonite (Qarag) in 
a process known as ocean acidification (J). 
Although substantial regional and decadal 
variability in ocean acidification, driven by 
climate-induced atmospheric and oceanic cir- 
culation changes, has been observed in low- 
and mid-latitude ocean basins (2-4) and in the 
Southern Ocean (3), long-term ocean acidifi- 
cation rates generally have followed the trends 
predicted from the increase in the concentration 
of CO, in the atmosphere (2, 3). For the Arctic 
Ocean, although numerical models have predicted 
high acidification rates (5-8), observation- 
based decadal rates of surface water pH; and 
Qarag Change are sparse. Over the past three 
decades, climate warming has induced notable 
changes in the Arctic atmosphere-ice-ocean 
system (fig. SI). These changes include (i) a 
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rapid sea ice retreat from a nearly fully ice- 
covered state before the 1990s to an ice-free 
state in the southern part of the Canada Basin 
and a partially ice-covered state in the north- 
ern part of the basin (9), (ii) a change from 
cyclonic circulation in the 1990s to an ano- 
malous anticyclonic circulation pattern (0), 
and (iii) a spin-up of the Beaufort Gyre (17) 
over the past 20 years (fig. S1). There are also 
increases in freshwater storage (11), Pacific 
Summer Water inflow (72), biological pro- 
duction (73), stratification (74), air-sea CO. 
exchanges and carbon sinks (/5, 16), and 
nutricline depth (77) and decreases in surface 
nutrients (78) and subsurface anthropogenic 
CO, storage (19). In addition, pH; and cal- 
cium carbonate saturation conditions have 
been substantially altered by these changes 
(20-26). For example, the Canada Basin was 
the first open-ocean basin where surface arag- 
onite undersaturation (i.e., Qarag < 1) was 
detected (20, 22) together with subsurface 
(21, 27) and intermediate water (28, 29) acid- 
ification. Thus, the Arctic Ocean is considered 
to be a bellwether of global climate change 
and ocean acidification (30). 

Here, by using pHy and Qarag estimates de- 
rived from data collected on 47 Arctic research 
cruises from 1994 to 2020 (see materials and 
methods), we document the high rates of long- 
term ocean acidification trends and basin- 
scale spatial expansion in terms of both pHy 
and Q,,,¢ declines in the western Arctic Ocean. 
With this dataset, we further examine how sea 
ice loss and increasing atmospheric CO, have 
altered the sea-surface carbonate chemistry 
over the past two decades, and we propose an 
“ice melt-driven enhanced anthropogenic CO, 
acidification” mechanism to explain such rapid 
rates of regional ocean acidification. 

The pHy and Q,,a¢ values were calculated 
based on underway measurements of the par- 


tial pressure of CO, (Pco2) and salinity-derived 
total alkalinity (TA), together with sea surface 
temperature and salinity, from 1994 to 2020 
(table S1). Data were supplemented with pH; 
and Qarag, Which were calculated from discrete 
measurements of TA and dissolved inorganic 
carbon (DIC) (see materials and methods). 
The salinity-derived TA and corresponding 
calculated DIC values match well with those 
of the discrete samples, yielding uncertainties 
of less than +14 umol kg? (fig. $2), which are 
further quality controlled as described in the 
materials and methods. The combined uncer- 
tainties of estimated pHy and Q,,,. were 
0.0113 and 0.0109, respectively, which were 
computed using the uncertainty propagation 
routine for CO2SYS (37). 

Our results show that the area of sea surface 
with relatively low pHy (<8.05) and Qarag (<1) 
expanded substantially (~0 to 0.88 x 10° km?) 
from 1994 to 2020, not only increasing in 
areal coverage but also extending into higher 
latitudes in the western Arctic Ocean (Fig. 1). 
During the 1990s, more than 90% of the 
western Arctic Ocean basin waters were cov- 
ered by sea ice, with a high pH, (8.17 + 0.05), 
and were supersaturated with respect to 
aragonite (Qarag = 1.46 + 0.23) (Fig. 1, Aand 
E). These initial conditions of the relatively 
higher pHy and Q,,a¢ in the sea ice-covered 
period are due to the absence of gas exchange 
to replenish CO, from the atmosphere and 
low water temperature (i.e., for the same TA 
and DIC values, pH, is higher in cold than 
warm waters). As sea ice in the western Arctic 
Ocean basin began to melt beyond the annual 
seasonal cycle of advance and retreat in the 
early 2000s, relatively lower pH, (8.12 + 0.04) 
and Qarag (1.21 + 0.14) were observed in sur- 
face waters. This acidification happened first 
in the southern Canada Basin and slope and 
then extended northward to 76°N (Fig. 1, B 
and F). By the late 2000s, extreme summer- 
time sea ice retreat mediated by both climate 
change and natural variability led to a broad 
ice-free area (~1.7 x 10° km?) with a large- 
scale lower pH, (8.04 + 0.02) and aragonite 
undersaturation (Qarag = 0.97 + 0.05) that 
extended into the central Canada Basin (Fig. 
1, C and G). Whereas the pH, decrease could 
be attributed to a combination of rapid warm- 
ing and CO, uptake from the atmosphere, the 
Qarag decrease was mainly due to CO, uptake 
because warming would result in an opposite 
trend. From 2011 to 2020, although the overall 
sea ice concentration and pHy and Qa values 
were similar to those of the late 2000s, the areas 
of low-pHy (8.02 + 0.01) and Q,,,.-undersaturated 
(0.95 + 0.05) waters continued to expand 
northwestward (Fig. 1, D and H). We attribute 
this acidification to the continuously increas- 
ing atmospheric CO, uptake over the ex- 
panded ice-free regions. Overall, from the 
1990s to the 2010s, rapid sea ice loss and 
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Fig. 1. Observations of ocean acidification 
and its decadal summertime trends in the 
western Arctic Ocean. (A to H) pHr [(A) to D)] 
and Qarag [(E) to (H)] in the western Arctic 
Ocean during 1994 to 1999, 2000 to 2006, 2007 
to 2010, and 2011 to 2020. The black dashed 
lines show the ice edge (with an ice concentra- 
tion threshold of 15% and gray shading repre- 
senting sea ice concentration) during the first 
week of September during the first year of the 
subperiod; the red dashed lines show the location 
of that edge during the final year of the 
subperiod. Cruise information is provided in table 
SL. Sea ice edge data are from the National Snow 
and Ice Data Center (https://nsidc.org/data/ 
seaice_index/). (I and J) decadal summertime 
trends of pHr and Qarag. Uncertainties associated 
with the trends were assessed by the Monte 
Carlo approach (see methods for detailed 
calculations). The error bars represent the 
combined uncertainty from sample measure- 
ments, carbonate system calculations, and the 
gridding-induced spatiotemporal variability. 

(K and L) Comparison of rates and mechanisms 
of acidification in the Arctic and other oceans. 
The gray vertical bars show annual rates, and the 
circles indicate the relative contributions of 
component mechanisms and their overall acidi- 
fication effect. A red-plus circle indicates a 
positive contribution to acidification (or a positive 
net effect), and a blue-minus circle indicates an 
effect that counters acidification; the size of each 
circle indicates relative magnitude. The Arctic 
Ocean acidification rate was calculated as the 
average of the rates obtained in this study for the 
four western Arctic Ocean subregions (SCB, 
NECB, NWCB, and IC); decadal trend values for 
the other ocean basins were obtained from 
previous studies listed in table S2. Error bars 
represent the standard deviation of the mean 
rates. OA, ocean acidification. 


acidification co-occurred in the western Arctic 
Ocean, but thereafter, relatively slow acidifica- 
tion occurred, accompanied by interannual 
fluctuation in sea ice coverage. 

To quantify the long-term trends in pH; 
and Qarag, We first divided the western Arctic 
Ocean into five subregions based on spatial 
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(Fig. 1A). We then averaged pHy and Quarag 
within each grid cell (0.1° latitude by 0.25° 
longitude) to obtain daily and monthly means 
for each cruise dataset. Finally, we averaged 
the gridded results within each subregion to 
obtain monthly and yearly (from June to 
October) means (see materials and methods). 
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Fig. 2. Linking Arctic Ocean basin acidification to sea ice loss. (A) Sea ice 
loss, acidification status, and CO> buffer capacity (Revelle factor) in the rapid 
sea ice melt regions of the Canada Basin (including SCB, NECB, and NWCB). 


The Revelle factor (the ratio of OPco2/Pco2 to ODIC/DIC) is a general measure 


of the ocean's sensitivity to an increase in COz. The sea surface temperature 


(SST) data were obtained from the Advanced Very High Resolution Radiometer 
(AVHRR) (39), whereas the sea ice extent data were obtained from the National 


Snow and Ice Data Center. Note that we use September monthly means to 


represent the decadal trends of sea ice extent. The middle and 

bottom panels show the mean decadal trends in the rapid ice melt area 

of the Canada Basin (south of 80°N), including the SCB, NECB, and 

NWCEB regions. (B) Long-term change in fractions of meltwater and river water 
in the surface water (<20 m) of the Canada Basin. The calculation methods 
follow those in (20, 21). (C) Relationships between sea ice extent and pH; and 
Qarag. The sea ice extent data are from https://nsidc.org/data/G02135/ 
versions/3. 


Different grid sizes, averaging schemes, de- 
seasonalization treatments, and sensitivity 
tests (randomly removing 15 or 30% of mea- 
surements or cruises) were tested (figs. S3 to 
SID for the determination of long-term trends. 
These various approaches yielded comparable 
and consistent rates of sea surface pH; and 
Qarag decrease in all subregions except for the 
NWCB and CS, where undersampling in the 
former and the large natural spatiotemporal 
variability in the latter prohibit a good per- 
formance of deseasonalization. 
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Our results revealed rapid acidification in 
the western Arctic Ocean basins, with a mean 
annual rate of —0.0069 + 0.0011 for pH; and 
—0.0216 + 0.0040 for Qara¢ from 1994 to 2020 
(table S2). These rates are approximately four 
and three times faster, respectively, than the 
long-term decline rates in other ocean basins 
[Fig. 1, K and L, table S2, and (32)]. Similarly, 
the increase rate (0.128 + 0.019 nmol kg? 
year ') of hydrogen ion [H*] is also about 
four times faster than that in other ocean 
basins (table S2). The mean annual acidifica- 


tion rate was highest in the NECB (-—0.0086 + 
0.0013 for pHy and —0.020 + 0.0028 for Quarag) 
and slowest in the ice-covered higher-latitude 
ocean basin (Fig. 1, I and J). These observa- 
tional rates in the western Arctic basins are 
much faster than those projected by both 
regional and global models (e.g., -0.0025 to 
—0.0030 year’ for pH) (5-8). By contrast, we 
found a relatively slower rate of acidification 
on the CS than in the basins, with a mean 
annual rate of change of —0.0031 + 0.0024 for 
PHry and —0.0009 + 0.0138 for Qarag (fig. S12). 
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Fig. 3. Evolution of pH; and ©,,a, as a function of 
days since ice retreat from 1994 to 2020 on the 
Arctic Ocean basin. (A to D) Sea ice concentration 
(SIC) (A), surface water pH; (B), surface water 

Qarag (C), and surface water Pcoz (D). The dots 
and error bars represent the means and standard 
deviations of SIC, pHt, Qarag, and Pcoz within 

1 day's intervals. The “clouds” of gray dots 

in the background show individual data points. 

The solid lines show the 1D model-simulated 

days since ice retreat (DSR) dependences 

of these two COz system parameters for four 
periods using a constant heating rate and 

panel until the DSR is 60, at which point the 
temperature is then held constant. The blue 

and red dashed lines represent the atmospheric CO2 
level in the 2007 to 2010 and 2011 to 2020 
periods, respectively. See materials and methods for 
an explanation. The vertical dashed gray lines 
denote DSR of -40, 0, and 60 days. 


The long-term acidification rates on the CS 
are likely mitigated by a strong and increas- 
ing biological CO, removal driven by nutrient 
inputs from the Pacific Ocean, which uses and 
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counteracts CO, influx from the atmosphere 
(13, 16, 33). 

When we focus on the regions of massive 
sea ice loss in the Canada Basin (including SCB, 


NWCB, and NECB), a notable finding appears— 
the decreases in pHy and Q,,;a. are strongly 
correlated with the decrease in sea ice extent 
over the past 26 years (Fig. 2). Notably, the 


30 SEPTEMBER 2022 * VOL 377 ISSUE 6614 1547 


RESEARCH | REPORTS 


Fig. 4. Sea ice loss and melt A 
are amplifying surface ocean 8.25 

acidification in the Arctic = 
Ocean basin. (A to C) The 8.20 1990s z 
decadal progression of ocean Ice coverage~95% 2 
acidification with respect to pHy 8.15 

(A), Qarag (B), and the Revelle = 

factor (C). The black dots repre- 8.10 

sent the observational conditions 

for pHz, Qarag, and the Revelle 8.05 

factor (values calculated from 

decadal means; table S3). The £.00 2010s 

gray dot represents the initial Nearly ice free 


condition after sea ice melt [a 
two-endmember mixing model 
consisting of 1990s ice-covered 
seawater and ice-melt water was 
applied to calculate diluted TA 
and DIC and carbonate parame- 15 
ters; TA was used as a tracer, and 

the meltwater endmember was a 
obtained from (40)]. The arrows 

indicate the processes of 1,3 
warming (orange), dilution (I, dark ts 
cyan), dilution-induced CO2 d ? a” 
uptake (II, magenta), and ice : ig” ty 
melt-induced COz uptake (Ill and a % 
IV, magenta). The dilution- 
induced process (Il) refers to the 
uptake of COz to compensate for 09 2010s 

the dilution-lowered Pcoz Nearly ice free 
(increased up to the initial Pcog in 
the 1990s; fig. S15); the ice melt- 2100 = 2050 201M) 1950 100 1830 1800 1750 
induced process refers to the DIC (mol kg) 

following CO uptake from the Cc 
atmosphere, where initial CO2 

deficit term (III) denotes the 2010s 

disequilibrium between 1990s Nearly ice free 

Pcos and air Pcoz, and atmo- 
spheric (Atm) COz increase term 
(IV) denotes acidification driven 
by the air Pco increase from 

the 1990s to 2010s. The blue- 
shaded areas indicate the possible 
decadal variations in pHy, Qarag, 
and the Revelle factor, which were 
synergistically affected by warm- 
ing and COz uptake. The initial 1990s 
status of the ice-covered Canada 14 Be towerage-d5% 
Basin in the 1990s is set as salinity 
(S) = 31.3, temperature (7) = 
-16°C, TA = 2192 pmol ke, 

DIC = 2059 pmol kg, and Peo, = 
278 patm (table S3). Dilution 

with a meltwater endmember of 
TA = 70 umol kg™ (40) results in a lower TA of 1912 mol kg”, DIC of 1792 umol kg™, and Peo: of 214 watm (and simultaneously lower Qarag Owing to decreased salinity 
and carbonate ion concentration, but higher pH; owing to decreased hydrogen ion concentration) (dilution effect I). When Pcos increases back to the initial Pcoz level before ice 
melting in the 1990s (dilution-induced COz uptake, Il), it causes a decrease in pH of 0.10 and a decrease in Qurag of 0.25. These decreases will be driven by subsequent ice 
melt-induced CO> uptake (III and IV, where IV represents the air Pcoz decadal increase of 38 atm and corresponding changes in pH and Qarag), which leads to a further decrease in 
pH of 0.05 and a decrease in Qarag Of 0.11. (D to F) Surface-water acidification parameters as a function of surface Pco2 for the global tropical ocean, the Southern Ocean, and 
the Arctic Ocean (ice-covered and ice-free scenarios) from the multidecadal perspective. The dashed lines are linear extensions of the trendlines, which may not be realistic. 
Calculations were made using CO2SYS with the following input values: T = 26°C, S = 35, and TA = 2332 umol kg™ for the tropical ocean; T = -0.3°C, S$ = 34, and TA = 2297 umol kg 
for the Southern Ocean; T = -1.6°C, S = 31, and TA = 2192 for the Arctic Ocean, ice-covered scenario; and T = 0.72°C, S = 26, and TA = 1912 umol kg™ for the Arctic Ocean, ice-free 
scenario (table S3). The red arrows refer to the processes that regulate decadal changes in pHy, Qarag, and the Revelle factor in (A) to (C). 
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earlier 18-year period (1994 to 2012) has the 
steepest decline trends in pHy (-0.0098 + 
0.0019 year™) and Qarag (0.0342 + 0.0077 year’), 
which corresponds well with the steepest de- 
crease in sea ice (Fig. 2). However, the decreas- 
ing trends of both pHy and Q,,,. were slower 
during 2010 to 2020, which is consistent with 
the alleviated reduction of sea ice since 2008, 
though a strong interannual variability is ap- 
parent (Fig. 2A). Nevertheless, even during 
this period, the rate of pHy decrease is still 
comparable to rates of other ocean basins (Fig. 
2A and table S2), which are mainly driven by 
the increasing atmospheric CO,. Unlike pH;, 
no statistically significant trend for Qarag Was 
found during 2010 to 2020, reflecting the 
counteracting effect of warming. Clearly, mas- 
sive sea ice retreat, modulated by the large- 
scale climate change pattern, has played a 
critical role in the fast acidification in the Arc- 
tic Ocean over the past two to three decades. 

To explain the link between rapid long-term 
pHy and Q,,a¢ decline rates and the massive 
sea ice loss in the Arctic Ocean, a thorough 
understanding of the evolution of carbonate 
chemistry in response to the sea ice condi- 
tion is required. In the 1990s, when most of 
the Arctic Ocean surface was covered by sea 
ice, the transfer of CO. from the atmosphere 
into the surface waters was impeded by sea ice 
(fig. S1A), resulting in a large deficit of DIC 
from the atmospheric equilibrium value (ADIC = 
-25 umol kg‘; figs. SI3C and $14). As a con- 
sequence, the cold surface seawater beneath 
the sea ice has a high potential to absorb 
atmospheric CO, once it is exposed to the 
atmosphere. 

Since the 2000s, the Arctic Ocean has ex- 
perienced accelerated warming and substan- 
tial sea ice retreat (e.g., ice extent was at its 
lowest in the summer of 2012) (Fig. 2A and 
fig. SIB). We used a simple one-dimensional 
(1D) dynamic model to simulate the response 
of Pcos, pH, and Q,;a¢ to the decrease of sea 
ice (Fig. 3; see supplementary materials for a 
description of the model). We found that, 
under partially or newly ice-free conditions, 
the low-Pco, waters that were originally under 
the ice were exposed to higher atmospheric 
Pc, and rapidly took up CO, through air-sea 
gas exchange (Fig. 3D). In other words, the 
initial CO, deficit resulted in a CO, increase 
“boost” over that time period. The shallow 
surface mixed layer and strong stratification 
(34, 35) also prevented the dilution of the 
absorbed COs, leading to rapid responses of 
the carbonate system parameters (Fig. 3, B 
and C, and fig. S1B). 

The rapid decreases in pHy and Qa along 
with sea ice retreat are seen not only over a 
seasonal time scale but also over a decadal 
time scale (Fig. 3). We further quantified and 
decomposed the decadal drivers of acidifica- 
tion (table S7) and translated the results to 
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a graphic illustration (Fig. 4, A to C). We 
found that the net increase in DIC due to CO, 
uptake plays the predominant role in regulat- 
ing acidification, with minor contributions 
from warming and dilution. However, we fur- 
ther diagnosed that, indirectly, approximately 
half of the changes in Pcos, pHy, and Qarag 
that resulted from CO, uptake are attributa- 
ble to the additional CO, deficit triggered by 
sea ice melt (i.e., dilution of TA and DIC leads 
to lower initial Pco.; see Fig. 4, A to C, fig. 
$15, and table S7). Thus, the indirect effect 
of dilution on the seawater carbonate sys- 
tem through the promotion of CO, uptake 
is an important factor driving acidification 
and weakening buffer capacity in the west- 
ern Arctic Ocean (Fig. 4, A to C). 

The progression in sea ice loss represents a 
major change in seawater carbonate chemis- 
try, from a state of well-buffered seawater with 
relatively high alkalinity and low Pco, under- 
neath the sea ice (fig. SIA) to a meltwater- 
diluted state characterized by lower alkalinity, 
higher Pcog, and lower buffer capacity under 
the partially or fully ice-free surface (Fig. 4, D 
to F, and fig. S1C). As a result, atmospheric CO, 
invasion would lead to a greater overall de- 
crease in pHy, Qarag, and buffer capacity in 
seawater diluted by meltwater than the orig- 
inal water (Fig. 4, A to C, and fig. S15), illus- 
trated also as a jump from one pH,7-Pco, 
evolving locus along the high iso-alkalinity 
line in the 1990s to the low iso-alkalinity line 
in the 2010s (Fig. 4, D to F). We name this 
distinctive mechanism “ice melt-driven en- 
hanced anthropogenic CO, acidification.” The 
mechanism explains the amplified rapid acid- 
ification observed in the Arctic Ocean over the 
past two to three decades, in contrast to the 
slower acidification in other global oceans 
along only one alkalinity line (Fig. 4, D to F). 
This mechanism could conceivably also oper- 
ate in the sea ice zone within the Southern 
Ocean, where seasonal sea ice melt also plays 
an important role in sea surface CO, dynam- 
ics and in air-sea CO, gas exchange (36). 

We must also point out that further warm- 
ing could result in contrasting impacts on pH; 
and Qarae trends (fig. S16; see supplementary 
text for details). This phenomenon has em- 
erged in late summer in ice-free southern re- 
gions since the 2010s, where both warming 
and CO, uptake from the atmosphere de- 
creased pH, (fig. SI6A), whereas warming 
(when it exceeds +1°C) increased and CO, 
uptake decreased Qarag (fig. SI6B). When tem- 
perature further increases, the net Qarag trend 
could even be positive, similar to what has 
been observed in some other parts of the 
world’s oceans (37, 38); this is also illustrated 
as the cancellation effect of the red-plus circle 
by the blue-minus circle in Fig. 1L). 

Although Arctic sea ice coverage has fluc- 
tuated over the past decade, it is also shifting 


from multiyear ice to first-year ice (fig. S17). 
This thinning trend of sea ice implies that 
summer sea ice will continue to decline in 
extent and lower the seawater alkalinity and 
buffer capacity until the Arctic Ocean is turned 
into an ice-free region. Finally, although the 
overall freshwater increase, including that 
from rivers, has decreased the anthropogenic 
CO, inventory in the subsurface water (19), 
there is, at present, no significant decadal trend 
in river-water content in the surface layer 
(Fig. 2B). Therefore, both larger winter-to- 
summer meltwater-induced seawater alka- 
linity dilution and greater subsequent CO, 
uptake from the atmosphere are expected to 
be the dominant processes in surface waters. 
As a consequence, the “ice melt-driven en- 
hanced anthropogenic CO, acidification” mech- 
anism may continue to operate over the next 
few decades, becoming even more pronounced 
until the time that sea ice completely dis- 
appears during summer (at which time, Arctic 
Ocean surface seawater may reach its lowest 
alkalinity) (Fig. 4). Therefore, this greatly am- 
plified summertime ocean acidification mod- 
ulated by large-scale climate change may 
lead to long-lasting impacts on the biogeo- 
chemistry, ecosystem, and organisms in the 
Arctic Ocean basins. 
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SEA LEVEL CHANGE 


A detection of the sea level fingerprint of 


Greenland Ice Sheet melt 


Sophie Coulson’, Sénke Dangendorf’, Jerry X. Mitrovica’, Mark E. Tamisiea‘, 


Linda Pan*, David T. Sandwell? 


Rapid melting of ice sheets and glaciers drives a unique geometry, or fingerprint, of sea level change. 
However, the detection of individual fingerprints has been challenging because of sparse observations 
at high latitudes and the difficulty of disentangling ocean dynamic variability from the signal. We predict 
the fingerprint of Greenland Ice Sheet (GrlS) melt using recent ice mass loss estimates from radar 
altimetry data and model reconstructions of nearby glaciers and compare this prediction to an 
independent, altimetry-derived sea surface height trend corrected for ocean dynamic variability in the 
region adjacent to the ice sheet. A statistically significant correlation between the two fields (P < 0.001) 
provides an unambiguous observational detection of the near-field sea level fingerprint of recent GrlS 


melting in our warming world. 


s an ice sheet loses mass, its gravita- 

tional attraction on the nearby ocean is 

reduced, causing ocean water to migrate 

away from the ice sheet. Additionally, 

the solid Earth rebounds in response to 
the reduction in surface loading. This combi- 
nation of geophysical processes leads to a rela- 
tive sea level fall within ~2000 km of the 
melting ice sheet and a progressive sea level 
rise outside of this region (/-17). This nonuni- 
form geometry of sea level change—commonly 
called a sea level fingerprint (6, 9), but more 
recently referred to as a barystatic-GRD (grav- 
ity, rotation, and deformation) fingerprint 
(12)—suggests the possibility of using tide 
gauge and satellite measurements of sea level 
change to both improve estimates of global 
mean sea level (GMSL) rise (73) and constrain 
the individual sources of meltwater (6, 7, 9). 
However, sea level changes are also affected 
by large dynamic sea level signals associated 
with ocean and atmosphere circulation and 
steric effects (14, 15), and by both natural and 
artificial variability in terrestrial water storage 
(16-18). 

The detection of individual sea level finger- 
prints will, in general, require observations 
spanning multiple decades. For example, sim- 
ulations of the sea level fingerprint and dy- 
namic sea level changes in response to melting 
of the Greenland Ice Sheet (GrIS) at a rate of 
0.5 mm/year in units of equivalent GMSL rise 
[in line with estimates of Greenland ice mass 
flux over the past 20 years (19)] suggest that 
the rigorous detection of the sea level finger- 
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print would require ~30 years of altimeter mea- 
surements if these were limited to latitudes 
lower than 66°N (20). The time scale of de- 
tection would, however, be reduced consider- 
ably if tide gauge or altimeter measurements 
were available at higher latitudes in proximity 
to GrIS, where the highest gradients in the 
predicted structure of the sea level fingerprint 
occur (20), or if ocean dynamic variability un- 
related to the ice melt can be isolated from sea 
level records. 

Detection efforts to date have largely fo- 
cused on sea level records outside the zone 
of major sea level fall, where the gradients 
and amplitudes of the fingerprint signal are 
considerably lower (27-23). Notably, Hsu and 
Velicogna (22) compared Gravity Recovery and 
Climate Experiment (GRACE)-based global 
sea level fingerprint predictions with ocean 
bottom pressure data from sensors in the 
tropics and determined that the fingerprints 
agreed with the annual phase of data and 
differed from the barystatic estimate at two 
of eight stations. Davis and Vinogradova (27) 
demonstrated that the fingerprint of GrIS ice 
mass loss had measurable influence on tide 
gauge records of sea level acceleration along 
the eastern US coast since the mid-1990s. 
Moreira et al. (23) compared modeled sea level 
fingerprints with steric-corrected altimetry- 
based sea level data in the Atlantic Ocean 
surrounding south-eastern Greenland (among 
other regions) over a 10-year period. Despite 
low signal-to-noise ratios, they found that 25% 
of the variance in the altimetry-derived data- 
set is explained by a sea level fingerprint. How- 
ever, observational datasets used in this study 
only extend to 65°N, and thus the majority of 
the ocean around Greenland was not captured. 
In this work, we built on these previous results 
to provide a statistically rigorous, unambiguous 
detection of the fingerprint signal from GrIS ice 
mass loss in sea surface height (SSH) trends 
encircling the ice sheet. 
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Fig. 1. Observed trends in SSH anomalies in the vicinity of Greenland. 
(A) Altimetry-derived mean SSH trends in millimeters per year over the 
period 1993-2019 (26-28). (B) As in (A), but corrected for ocean dynamic 
variability (Fig. 2). (©) The associated error estimate (based on an 


We analyzed updated multimission altime- 
try records and corrected for the ocean dy- 
namic variability signal to compute residual 
observational trends in sea surface height 
anomaly extending up to 80°N. We then com- 
pared these SSH trends with predicted sea 
level fingerprints using classical sea level mod- 
eling over a 27-year period and an 18-year 
period. The two time periods were considered 
separately to demonstrate robustness; the lon- 
ger time range, from 1993 to 2019, includes the 
full observational period, taking advantage 
of all multimission altimetry data available, 
whereas the shorter time range, from 2002 to 
2019, highlights a period of increased melting 
of the ice sheet (19, 24-26). 

Satellite altimetry has provided measure- 
ments of SSH anomalies since the early 1990s. 
The Copernicus Marine Environment and Mon- 
itoring Service (CMEMS) (27) reprocesses 
multimission altimetry records, and the most 
recent analysis is based on the Data Unifica- 
tion Altimeter Combination System (DUACS) 
DT2018 [see figure 1 of (26)]. After filling data 
gaps at higher latitudes (see supplementary 
materials), we used these observational records 
to compute mean trends over the periods 
1993-2019 and 2002-2019 (Fig. 1, Aand D). 
Notably, in the North Atlantic, the observa- 
tions from the DUACS altimeter processing 
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extend up to 80°N (Fig. 1), although their tem- 
poral availability decreases northward (fig. S1). 
The effective spatial resolution of the data is 
estimated to vary from ~100 km at the south- 
ern tip of Greenland, decreasing to ~250 km 
at 72°N in Baffin Bay on the west side of 
GrIS, and to ~150 km at 66°N on the east side 
between Greenland and Iceland (28). No esti- 
mate of resolution is provided for sites above 
72°N in (28). 

Over the 1993-2019 period, SSH anomaly 
rates off the west and south coast of Greenland 
trend downward toward the coastline in a 
concentric structure indicative of a Greenland 
ice melt fingerprint (Fig. 1A). Rates over the 
shorter 2002-2019 period (Fig. 1D) retain some 
of this spatial structure, but the SSH anomaly 
becomes generally noisier. The change in rates 
is particularly pronounced in the subpolar 
gyre region southeast of Greenland, where 
rates reverse sign from positive since 1993 
to negative since 2002. These decadal rate 
changes in the subpolar gyre region have 
previously been linked to changes in (largely 
wind-driven) large-scale ocean circulation 
changes (29, 30). Indeed, the subpolar gyre 
region has been transitioning from a period of 
strong warming between the early 1990s and 
mid-2000s to a cooling thereafter (29), and the 
signal has also been propagating up into the 


autoregressive process of the order 1) in this corrected signal. (D to F) As in 
(A) to (C), except for the shorter time period 2002-2019. The red dotted 
box and black dashed line in (E) denote a region and profile considered 

in the comparative analysis of Fig. 4. 


coastal zones of the northeastern Labrador 
Sea (Fig. 2). 

One approach to isolating the nonuniform 
geometry of sea level fingerprints is by esti- 
mating depth-integrated steric height changes 
from temperature and salinity fields and re- 
moving them from altimetric SSH (23). Al- 
though this may work well in the deep ocean 
away from a melt source, SSH changes in the 
vicinity of an ice sheet can involve both deep 
and shallow ocean (fig. S1B). The latter is 
dominated by dynamic ocean mass redistrib- 
ution, which is reflected in ocean bottom 
pressure changes rather than the steric height 
(fig. S2) (4). In the North Atlantic, this dy- 
namic ocean mass variability has been shown 
to be tightly linked to variations in large-scale 
wind stress forcing (37). We therefore used 
an alternative approach, in which we iden- 
tified common interannual variability between 
SSH and wind stress curl over the extratropical 
North Atlantic sector with a maximum co- 
variance analysis and used the leading modes 
of wind stress curl as predictors for a regres- 
sion with SSH (see supplementary mate- 
rials). Depending on location, this regression 
explains up to 90% of the local SSH variability 
around Greenland (Fig. 2B; median explained 
variability over the entire domain is 69%) 
and captures well the decadal trends in the 
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Fig. 2. Correction of ocean dynamic variability across the study 
area. (A) SSH time series before (gray and black) and after (blue) 
removing ocean dynamic variability associated with wind stress forcing 
(red) averaged over the area bounded by the red box in (B) over 

the period 1993-2019. Note that a vertical offset of ~0.08 m has been 
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Fig. 3. Predicted trends in SSH anomalies in the vicinity of Greenland. 
Trend in SSH over the periods (A) 1993-2019 and (B) 2002-2019 
(millimeters per year) computed using a radar altimetry—derived 

record of Greenland Ice Sheet mass balance over the same period (19) 


extended subpolar gyre region (Fig. 2A). Thus, 
the regression accounts for a significant frac- 
tion of both spatiotemporal variability in steric 
height as well as ocean dynamic mass redis- 
tribution. When this signal is removed from 
the altimetric SSH field, the time series be- 
come less noisy (Fig. 2A) and the spatial trend 
patterns over the two time periods become 
more similar (Fig. 1, B and E). Additionally, 
because of the removal of ocean-internal var- 
iability, the trend estimation error is signif- 
icantly reduced (the median reduction is ~45% 
over the entire region). 

We predicted the rate of SSH change over 
the periods 1993-2019 and 2002-2019 by adopt- 
ing a reconstruction of GrIS mass balance 
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2010 


2016 


estimated from an analysis of multisatellite 
Ku-band altimetry measurements for the main 
ice sheet (19) and a model reconstruction of 
the glaciers at the periphery of the ice sheet 
and in nearby regions (Arctic Canada and 
Iceland) forced with the ERA5 global climate 
reanalysis (32, 33). We calculated gravitation- 
ally self-consistent sea level changes using the 
theory and pseudo-spectral algorithm described 
in (34), with truncation at spherical harmonic 
degree and order 512 (Fig. 3, A and B), neg- 
lecting the effect of Earth’s rotation to be con- 
sistent with the orientation of the reference 
frame assumed in altimetry. In addition to 
the recent ice history in Greenland, the sim- 
ulation requires a model for the rheological 


70°wW 60°W 50°W 40°W 30°W 20°W 


applied to the blue time series for better visibility. The correlation 
between the area average of SSH and ocean dynamic variability 

is 0.79. (B) Variance explained by the maximum covariance analysis 
regression (as an estimate of ocean dynamic variability) at each 
grid point around Greenland. 
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and model reconstructions of Greenland peripheral and nearby glaciers 
(33). (C) Ongoing trends in SSH due to ongoing glacial isostatic 
adjustment associated with the last ice age, computed using the 
ICE-6G_C/VM5a model of GIA (37). 


structure of the solid Earth. For this purpose, 
we adopted a depth-varying Maxwell visco- 
elastic model in which the seismically inferred 
Preliminary Reference Earth Model (PREM) 
(35) provides depth-varying elastic and den- 
sity profiles through Earth’s crust and mantle. 
The model has an upper-mantle viscosity of 
10° Pa s, consistent with the inference of (36), 
which is based on the crustal response to re- 
cent ice mass flux, and a lower mantle viscosity 
of 5 x 107" Pa s. The elastic response strongly 
dominates (to >90%) the total predicted sig- 
nal. For direct comparison to the altimetry- 
derived datasets, we considered the modeled 
change in SSH rather than the full calculated 
sea level change, where the latter includes 
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Fig. 4. Comparison of predicted and observed SSH trends in the vicinity of 
Greenland. Trends over the periods (A and B) 1993-2019 and (€ and D) 2002-2019. 
[(A) and (C)] Altimetry-derived mean SSH trends [(26-28); corrected for ocean 
dynamic variability], corresponding to Fig. 1, B and E, respectively, along the black 
dashed profile shown in Fig. 1E (blue line, with error bars taken from Fig. 1, C and F). 
Trends along the same profile computed using a radar altimetry—derived record of 


changes in bathymetry not captured in satel- 
lite altimetry observations. We also performed 
areference frame shift to the center of mass of 
the Earth system (i.e., a system including sur- 
faces masses). The resulting trend maps (Fig. 
3, A and B) reveal the typical fingerprint 
structure, with a predicted fall in SSH across 
most of the region covered by the figure, with 
progressively larger magnitude toward the 
site of the ice mass flux. 

As we have noted, previous analyses of GrIS 
mass flux indicate that a major increase in 
melt rate of the Greenland ice sheet began in 
~2002 (19, 24-26). (The mean melt rates in- 
cluded in our calculation are 0.74 mm/year 
across the period 1993-2019 and 1.10 mm/year 
across the period 2002-2019, in units of equiv- 
alent GMSL rise.) This increase is apparent in 
the higher magnitude of the predicted signal 
in Fig. 3B relative to Fig. 3A. However, this 
difference is relatively localized to the zone of 
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melting, and the signal over the ocean in the 
two cases is highly similar. This similarity ex- 
plains the consistency of the corrected obser- 
vational trends across the same two periods 
(Fig. 1, B and E). 

In addition to predicting the impact of mod- 
ern GrIS ice mass loss on SSH change, we per- 
formed a simulation to determine the ongoing 
impact of the last ice age on SSH in the region 
(Fig. 3C). We reserve the term glacial isostatic 
adjustment (GIA) for this signal. The GIA 
calculation was performed using the same 
sea level theory and algorithm as above (34) 
and is based on the paired models of the ICE- 
6G_C ice history from the Last Glacial Maxi- 
mum onward and the VM5a viscosity model 
(37). The prediction based on this model shows 
little structure and varies by only ~0.2 mm/year 
across the entire region. 

The observed SSH trends (Fig. 1, B and E) 
include all signals that contribute to the sea 


66 68 70 72 74 76 


Observed SSH Rate (mm/year) 


GrlS mass balance (19) and model reconstructions of Greenland peripheral and 
nearby glaciers [(33); blue shading, corresponding to Fig. 3, A and B, respectively]. 
The shading accounts for errors in the ice mass history reported in (19, 33). 

[(B) and (D)] Predicted SSH rates associated with modern melting of the GrlS (Fig. 3, 
A and B) versus observed rates (Fig. 1, B and E). Best-fitting slopes given in dashed 
black. The two fields are sampled within the red box shown in Fig. 1E. 


level field, including those associated with 
other areas of ice mass flux, long-term global- 
mean steric effects and terrestrial water stor- 
age. These additional signals are likely to be 
smooth across the North Atlantic region (low 
or zero gradient) (15). Thus, a positive offset 
between the observed trends and our predic- 
tion from recent GrIS (and neighboring) mass 
flux (Fig. 3, A and B) is expected. The detection 
of the latter fingerprint signal in the former 
must therefore be based on regional gradients 
in the signals. In this regard, a correlation be- 
tween the maps of observed SSH trends and 
the predicted trends—both reflecting the hall- 
marks of a fingerprint geometry—is qualita- 
tively apparent. 

A preliminary examination of this correla- 
tion is provided in Fig. 4, A and C, in which 
the observed (blue line with vertical error 
bars) and predicted (blue shaded region) sig- 
nals along the single north-south profile 
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shown in Fig. 1E are directly compared, for 
the time windows 1993-2019 and 2002-2019, 
respectively. The predicted signal in both frames 
accounts for the uncertainty in the altimetry- 
derived mass loss history (19, 33). The two signals 
track each other closely over both time periods, 
showing comparable trends toward higher 
rates moving southward along the profiles. 

Next, we extended the above analysis by 
sampling the observed and predicted SSH 
trends (Figs. 1, B and E, and 3, A and B) within 
a geographic region bounded by the red dot- 
ted box in Fig. 1E. The southern and eastern 
borders of the box are constrained to avoid the 
localized SSH anomaly in the subpolar gyre 
region (in the North Atlantic south of Green- 
land and Iceland). Moreover, any grid points 
associated with SSH observations within 30 km 
of the shoreline are omitted because of re- 
duced data reliability in the vicinity of the 
coast (38). Figure 4, B and D, shows scatter- 
plots of the results for time windows 1993-2019 
and 2002-2019, respectively. The best-fitting 
slopes for the observed versus predicted SSH 
trends are 0.89 for the period 1993-2019 and 
0.89 for the period 2002-2019. This analysis 
accounts for the variation in spatial resolution 
as a function of latitude (28) using a weighting 
approach. (As noted above, the resolution 
varies from ~100 km at the southern tip of 
Greenland, decreasing to ~150 km at 66°N 
and 250 km at 72°N. North of 72°N, we assume 
that the resolution remains at 250 km.) Ac- 
counting for errors in both the observed SSH 
trends and the model predictions (see supple- 
mentary materials), the Pearson correlation 
coefficient between the samples is 0.83 + 0.26 
for the 1993-2019 period and 0.77 + 0.20 for 
the 2002-1993 period. Both the best-fit anal- 
ysis and the correlation coefficients indicate a 
strong and statistically significant linear as- 
sociation (P < 0.001) between the predictions 
and the observations. 

To gain insight into the origin of the scatter 
in Fig. 4, B and D, we computed maps of the 
distance between each point in these figure 
panels and the associated best-fit line (fig. S4). 
The result indicates that departures from the 
best-fit line are relatively small across the 
region, with the exception of sites close to 
the shoreline of Greenland and in the vicinity 
of the subpolar gyre. As noted above, the 
former may arise from the decrease in ac- 
curacy of the altimetry measurements of SSH 
close to shorelines. The latter suggests that a 
residual ocean dynamic signal exists in the 
vicinity of the subpolar gyre after the correc- 
tion based on the maximum covariance anal- 
ysis, particularly for the shorter time span 
2002-2019. 

Recent updates to multimission altimetry 
records (26-28) provide almost three decades 
of observations of SSH trends extending up to 
80°N. This latitude range allows for examina- 
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tion of the largest amplitudes and gradients of 
the sea level fingerprint generated by GrIS 
melt. The analyses herein indicate that, after 
the removal of ocean dynamic variability, the 
geographic variation in the observed SSH 
trends in the oceans encircling Greenland, 
over the periods 1993-2019 and 2002-2019, 
are dominated by the sea level fingerprint pro- 
duced by modern melting of GrIS and nearby 
glacier systems. These analyses provide a stat- 
istically significant detection of the sea level 
fingerprint of modern ice mass flux. Our re- 
sults demonstrate that fingerprint geometries 
are rapidly established within the global oceans, 
they independently affirm the accuracy of 
altimetry-based constraints on SSH variations 
and polar ice mass flux, and they validate the- 
oretical and numerical frameworks for predict- 
ing patterns of gravitationally self-consistent 
sea level change. Furthermore, the approach 
we have described that isolates decadal signals 
by means of a maximum covariance analysis 
regression with wind fields is universally ap- 
plicable and markedly shortens the time scale 
necessary for fingerprint detection by over- 
coming several major obstacles; namely, that a 
steric height correction for ocean dynamic 
variability is inappropriate on shallow shelves 
and temperature and salinity data are gener- 
ally more uncertain in polar regions. The com- 
bination of near-field sea level records with 
this analysis procedure can be applied in 
future efforts to detect sea level fingerprints 
of ice mass flux for all polar ice sheets and 
large-scale glacier systems, thus providing 
important additional constraints on their rela- 
tive contributions to the budget of GMSL and 
on pressing efforts to perform accurate proj- 
ections of the global geometry of sea level 
changes in our progressively warming world. 
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Hydrophobic polyamide nanofilms provide rapid 
transport for crude oil separation 
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Hydrocarbon separation relies on energy-intensive distillation. Membrane technology can offer an 
energy-efficient alternative but requires selective differentiation of crude oil molecules with rapid liquid 
transport. We synthesized multiblock oligomer amines, which comprised a central amine segment with 
two hydrophobic oligomer blocks, and used them to fabricate hydrophobic polyamide nanofilms by 
interfacial polymerization from self-assembled vesicles. These polyamide nanofilms provide transport of 
hydrophobic liquids more than 100 times faster than that of conventional hydrophilic counterparts. 

In the fractionation of light crude oil, manipulation of the film thickness down to ~10 nanometers 
achieves permeance one order of magnitude higher than that of current state-of-the-art hydrophobic 
membranes while retaining comparable size- and class-based separation. This high permeance can 
markedly reduce plant footprint, which expands the potential for using membranes made of ultrathin 


nanofilms in crude oil fractionation. 


eparation and purification play a crucial 

role in the oil and gas industry but rely 

on energy-intensive distillation processes, 
which account for 10 to 15% of global en- 

ergy use (7). Membrane-based separation 

has emerged as a promising alternative because 
of its innate advantage of low energy consump- 
tion. One successful example is the produc- 
tion of fresh water from seawater by reverse 
osmosis (2). Thin-film composite membranes 
that are manufactured by means of inter- 
facial polymerization show exceptional water 
permeance and high salt rejection, which is 
achieved through their thin and highly cross- 
linked polyamide selective layer (2). In search 
of a similar paradigm change, organic solvent 
reverse osmosis has been developed for molec- 
ular separations in organic liquids (3, 4) and 
applied to the fractionation of crude oil (5, 6). 
However, thin-film composite membranes 
made from conventional chemistry exhibit 
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limited permeance of hydrocarbon liquids, 
owing to the hydrophilic nature of polyamide 
or polyester networks in the separating layer 
(7-9). To enhance the hydrophobicity, attempts 
have been made to cap the polyamide layer 
with hydrophobic groups or introduce hydro- 
phobic groups to the monomers used during 
the interfacial reaction (10-15). The permeance- 
selectivity range of the resulting membranes 
improved, but the permeance remains too low 
for industrial crude separation (5, 6, 16). There- 
fore, creating hydrophobic separating layers 
with high permeances for nonpolar liquids and 
tight selectivities for crude oil fractions remains 
of interest. 

We report the fabrication of ultrathin hy- 
drophobic polyamide nanofilms from vesicles 
of self-assembled multiblock oligomer amines 
(MOAs) by means of interfacial polymerization 
and their performance in hydrocarbon separa- 
tions. Amphiphilic MOAs, which include a fluo- 
rine oligomer series and an alkane oligomer 
series, were synthesized by functionalizing the 
two ends of a hydrophilic precursor, pentaeth- 
ylenehexamine (Ng), with hydrophobic inert 
oligomers containing R-OH (R =I to VI) (Fig. 1A 
and fig. S1). The two-step reaction used carbon- 
yldiimidazole as a coupling agent, through which 
we tuned the MOA structure by attaching oligo- 
mers of increasing length. Nuclear magnetic 
resonance and Fourier transform infrared (FTIR) 
spectroscopy confirmed the formation of MOAs, 
including FNe6F;, FoNoF 9, Fi3Ne6Fy3, C3N6Cs, 
CeoN6Ce, and CyoNgCio (figs. S2 to $8). 

MOAs were readily soluble in tetrahydrofuran 
(THF) and formed a clear solution (Fig. 1B). 
Upon the addition of water to reach a 95/5 
water/THF mixture, when present above the 
critical aggregation concentration (CAC), the 
MOAs self-assemble into nanoscale micelles 
or vesicles (Fig. 1B and figs. S9 to S11) because 


of their inherent amphiphilicity (77). We con- 
jecture that the spherical vesicles are composed 
of aggregated hydrophobic oligomers facing 
inward and extended hydrophilic amines fac- 
ing outward to the aqueous solution (fig. S12) 
(78). Scanning electron microscope (SEM) and 
transmission electron microscope (TEM) images 
were consistent with the conjectured core-shell 
structure (Fig. 1B and figs. S13 and S14). 

Amines on the outer shell of MOA vesicles 
can react with trimesoyl chloride at a free 
aqueous-organic interface to fabricate a highly 
cross-linked polyamide nanofilm (Fig. 1C). The 
nanofilm comprises smooth surfaces formed 
from the extended MOAs, with protuberances 
introduced by the vesicles present above the 
CAC. These nanofilms were flexible and robust, 
with no macroscopic defects observed while 
transferring them onto a water-air surface 
(Fig. 1D). When using more concentrated MOA 
solutions, nanofilms shrunk after transfer to 
the water surface (fig. S15 and movie S1), 
which is consistent with their hydrophobicity. 
Free-standing nanofilms were then adhered 
onto porous polymeric supports and used as 
composite membranes (fig. S16). Both SEM 
cross-sectional images (Fig. 1E) and atomic 
force microscopy (AFM) images (Fig. 1F) reveal 
the existence of vesicles in the nanofilms, with 
~200-nm diameters (figs. S17 to S19), which 
matches well to the vesicle sizes measured 
in solution by using dynamic light scattering 
(fig. S11). 

We hypothesized that the vesicles provide 
enhanced hydrophobicity across the nanofilm 
because of the aggregation of hydrophobic 
segments in MOAs. To verify this hypothesis, 
we produced smooth nanofilms without vesi- 
cles using an F;N¢F;, concentration below the 
CAC (Fig. 2A), which exhibited a water contact 
angle of ~74° (fig. S20). This increased to ~93° 
for membranes that incorporate vesicles by 
using concentrations above the CAC (Fig. 2B 
and fig. S20), which indicates high hydro- 
phobicity (19) while retaining a similar cross- 
linking degree (fig. S21). TEM images and the 
corresponding element mapping demonstrate 
the fluorine distribution across the nanofilm 
(Fig. 2, C and D). The dispersed fluorine con- 
tent in the smooth regions corresponded to 
nanofilms that were formed by extended 
MOAs, in contrast to concentrated fluorine 
intensity in the vesicles (Fig. 2C), which fur- 
ther enhanced with increasing MOA concen- 
tration (figs. S22 and S23). These results reveal 
the close proximity of fluorine oligomers inside 
the vesicles. The nanofilm was further stained 
with ruthenium tetroxide, which preferentially 
attaches to aromatic groups in the polyamide 
network (20). This enhances contrast and vis- 
ibility of the polyamide separating layer under 
TEM (Fig. 2, E and F). A smooth nanofilm was 
formed at concentrations below the CAC, 
whereas flattened vesicles were observed in 
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Fig. 1. Fabrication of hydrophobic polyamide nanofilms with MOAs. 

(A) Synthetic route to MOAs with different oligomers R-OH (R is | to VI) by using 
carbonyldiimidazole (CDI) as coupling agent. | = Fs, Il = Fo, Ill = Fy3, IV = Cs, 

V = Cg, and VI = Cyo. (B) Schematic and photographs of extended MOAs in THF, 
which formed self-assembled micelles or vesicles by the addition of water. 
(Inset) A TEM image of FsNeFs (0.1 wt %) self-assembled MOAs with core-shell 
structure. (C) Fabrication of ultrathin polyamide nanofilms by interfacial 


t) 


polymerization at a free interface between a hexane phase containing trimesoyl 
chloride and an aqueous phase (95 wt % water and 5 wt % THF) containing 
MOAs. (D) Photograph of a free-standing nanofilm transferred from the free 
interface onto a water-air surface. (E) Cross-sectional SEM image of the ultrathin 
nanofilm transferred onto an alumina support. (F) AFM height image of 

the ultrathin nanofilm transferred onto a silicon wafer. Both images reveal the 
incorporation of MOA vesicles into the nanofilms. 


nanofilms formed at MOA concentrations 
above the CAC. The high contrast across the 
vesicle surfaces reveals that the amines on 
the outer shell of vesicles reacted to form 
continuous cross-linked polyamide films during 
interfacial polymerization (Fig. 2F), which is 
chemically homogeneous with the smooth poly- 
amide layer formed from extended MOAs. Fur- 
thermore, the interiors of the vesicles showed 
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no signs of staining, which suggests a hollow 
structure beneath the ~30-nm-thick separat- 
ing layer, despite the overall thickness of the 
vesicles being >100 nm. These vesicles were 
mechanically robust and conserved their 
structures after being pressurized at 10 bar 
(figs. S24 and $25). AFM images demonstrated 
that increasing the length of fluorine chains 
from F;NeF; to Fj3N¢F3 in MOA monomers led 


to a corresponding increase in the nanofilm 
thickness (Fig. 2, G and H, and figs. S26 to S28). 

We hypothesized that the close proximity of 
hydrophobic chains in these vesicles provides 
rapid transport pathways for nonpolar liq- 
uids through the nanofilm. Further, MOA 
membranes could be activated by acetone to 
increase permeance. We assert that acetone 
activation rearranges the hydrophobic blocks 
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Fig. 2. Characterization of hydrophobic polyamide nanofilms fabricated 
from MOAs. (A and B) SEM surface images of MOA nanofilms fabricated at (A) 
F5NeFs5 concentration (0.005 wt %) below CAC and (B) FsNeFs concentration 
(0.02 wt %) above CAC. The protuberances correspond to the vesicles of self- 
assembled MOAs. (C) TEM images of nanofilms made by using 0.02 wt % FoNeFo 
and corresponding fluorine mapping from energy-dispersive x-ray (EDX) analysis 
(inset). (D) High-angle annular dark-field scanning TEM (HAADF-STEM) cross- 
sectional images of the nanofilm made by using 0.1 wt % FsNeFs and its 

EDX fluorine mapping (inset). PAN, polyacrylonitrile. (E and F) TEM cross- 
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sectional images of the nanofilm made from CgNgCe at (E) low concentration 
(0.005 wt %) and (F) high concentration (0.1 wt %), in which the nanofilms 
were stained with ruthenium tetroxide to provide contrast for aromatic groups 
present in polyamide networks. The dense stripes in (F) reveal the active 
thickness of the nanofilm with hollow channels beneath. (G and H) AFM 

height (G) image and (H) profiles of F5NeFs (0.02 wt %), FoNgFg (0.02 wt %), 
and Fi3N¢Fy3 (0.025 wt %) nanofilms transferred onto silicon wafers for the 
measurement of nanofilm thickness. The profile lines across the protuberances 
provide width and height of MOA vesicles. Scale bar, 400 nm. 


so that some of the initially inward-facing 
hydrophobic tails inside the vesicles extend 
from the outer surface (fig. S12). The activated 
F NgFg membrane provided hexane permeance 
an order of magnitude higher as compared with 
the membrane before activation (Fig. 3A). This 
corresponded to an increase of the water con- 
tact angle from 82° to 98° after activation (Fig. 
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3A inset), which is consistent with exposure of 
the hydrophobic blocks (17). The net effect is 
creation of hydrophobic expressways for non- 
polar solvents, which is similar to the ultrafast 
water transport through aquaporin membranes 
(21). Although conventional polyamide mem- 
branes exhibited enhanced permeance for all 
solvents after activation by dimethylformamide 


(DMF), MOA membranes showed a significant 
enhancement only for nonpolar solvents (figs. 
$29 and S30). Moreover, MOA membranes 
after DMF activation exhibited a relative in- 
crease in permeance approximnately eight times 
lower than those activated by acetone (fig. S31). 
Seeking to further explain this phenomenon, 
we examined the surface morphology, thickness, 
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Fig. 3. Performance of nanofilms made from MOAs on PAN supports. 

(A) Hexane permeance for membranes made from 0.02 wt % FoNeFo befo 
and after acetone activation. (Inset) The corresponding water contact ang 
(B) Plot of solvent permeances through membranes made from 0.02 wt % 
FsNeFs and its precursor amine, pentaethylenehexamine (Ng), against the 
solvent viscosity, with both membranes being tested after acetone activati 
(Inset) The water contact angle for both membranes. (C) Nonpolar solvent 
permeances including hexane, heptane, and toluene for FyaNgFg membranes 


with increasing MOA concentrations. (D) Plot of hep 


and elemental composition of nanofilms before 
and after solvent activation (figs. S19, S21, and 
$28) and observed no notable differences. There- 
fore, we attribute the permeance boost to an 
increase in hydrophobicity that arises from 
the rearrangement of hydrophobic blocks at a 
molecular level upon acetone exposure. 

To demonstrate the impact of hydropho- 
bic oligomers, we used the precursor amine 
without oligomers, pentaethylenehexamine 
(Ng), to prepare hydrophilic polyamide nano- 
films under identical conditions. This nano- 
film was smooth, with no vesicles observed on 
the surface (fig. S19). Composite membranes 
comprising Ng nanofilms showed no perme- 
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on.  (F) Toluene permeability 


FoNeFo, and FisN6Fi3) and 


ane permeance against 


ance increase after activation (fig. S32), ex- 
hibiting negligible permeances for nonpolar 
solvents, but showed high permeance for polar 
solvents, including water (Fig. 3B and table S1). 
By contrast, membranes that incorporated 
MOA vesicles reversed this trend, demon- 
strating fast transport for nonpolar solvents 
(Fig. 3B and fig. S33). For example, the hexane 
permeance reached 63.9 + 5.6 liters m* hour * 
bar’, which is >100 times higher than that 
of Ng membranes and other conventional hy- 
drophilic polyamide membranes (9). This is 
consistent with the enhanced hydrophobic- 
ity of the MOA nanofilm, for which the water 
contact angle increased from 42° for the Ng 


time for F5NgFs, FoNgFo, and F,3N¢F13 membranes operating in a continuous 
crossflow rig. (E) Comparison of heptane permeance and molecular weight 
cut-offs for polystyrene markers between MOA membranes (this work) 

and membranes reported in the literature. MWCO, molecular weight cut-off; 
PDMS, polydimethylsiloxane; PIM-1, polymers of intrinsic microporosity-1. 


of membranes made from fluorine series (F5NeFs, 
alkane series (C3NgC3, CeNgCe, and CipNgCio) of MOAs. 


The error bars represent the standard deviation, which was calculated based on 
the results from three separate membranes. 


membrane to 93° for the F;NgF; membrane 
(Fig. 3B, inset). 

To further evaluate the impact of vesicles, 
we prepared nanofilms with varying MOA con- 
centrations (Fig. 3C and fig. S34). Below the 
CAC, no vesicles were formed in the aqueous 
phase, and the surface of the resulting nano- 
films was relatively smooth (fig. S17) because the 
heat from the vigorous cross-linking reaction 
was efficiently dissipated through the bulk 
aqueous solution at the free interface (22). 
These pre-CAC membranes exhibited limited 
permeance increase after activation (fig. S35), 
with toluene permeance of 1.2 + 0.3 liters 
m” hour” bar” recorded. This increased to 
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Fig. 4. Separation of synthetic and real crude oil by using nanofilm 
composite membranes made from MOAs. (A) Ratio of concentrations in 
permeate (C,) versus retentate (C,) of components in a synthetic crude 

oil fractionated by C3Ne¢C3 membrane and commercial benchmark, ONf-2 
membrane (40 bar, 22°C, cross-flow conditions). Molecules above the reference 
dash line were more concentrated in the permeate as compared with the feed, 
whereas the molecules below the reference dash line were more concentrated in 
the retentate. (B) Comparison between membranes comprising MOAs (FsNeFs, 
FoNoFo, FisNeFi3, C3NeC3, CeNeCe, and CyoN¢Ci9) and commercial membranes 


Molecular weight (g-mo!") 


tested under the same conditions (40 bar, 22°C, cross-flow conditions), measured 
as C,/C, for iso-cetane (226.5 g mol”) and synthetic crude oil permeance. 

(C) Permeance of real light shale-based crude oil for F;3NgFi3 membrane at 30°C 
and SBAD-1 membrane at 130°C, adapted from (5). (D) Boiling-point distribution 
of feed, permeate, and retentate from F,3N¢F;3 membrane fractionation of light 
shale-based crude oil. (Inset) Permeate (left) and retentate (right) solutions. 

(E) Comparison of C,/C, extracted from experimental simulated distillation curve 
of permeate and retentate to model predictions for CeNgCg membranes and 
Fy3NeF13 membranes. 


14.8 + 1.2 liters m™? hour bar’ for mem- 
branes incorporating vesicles made by using 
MOA concentrations above the CAC (Fig. 3C), 
despite thinner nanofilms and a similar cross- 
linking degree for the pre-CAC membranes 
(figs. S21 and S27). Although the permeance 
notably increased, the rejection performance 
remained unchanged (fig. $36). 

One advantage of MOA membranes is the 
potential to manipulate their performance 
by tuning the oligomer length or chemistry. 
Heptane permeance increased as the fluorine 
chain length shortened from F,3N¢F 13 to F;NeF; 
(Fig. 3D) because of the thinner separating 
layer created (Fig. 2H). The permeances re- 
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mained constant over 48 hours, which indi- 
cates no collapse of vesicles under 10 bar applied 
pressure (fig. S37). Polystyrene oligomers and 
dimer were dissolved in heptane to investigate 
their separation performance. The molecular 
weight cut-offs ranged from 395 to 450 g mol“, 
which is consistent with their dye rejections in 
methanol (fig. S38 and table S2). We plotted 
a trade-off between heptane permeance and 
molecular weight cut-off of polystyrene to 
compare the performance among MOA mem- 
branes, the commercial benchmarks ONf-2 
and PuraMem 6008S, and other membranes 
reported in literature (Fig. 3E and table S3) 
(12, 23-26). MOA membranes exhibited sep- 


aration performance well beyond the upper 
bound. 

By tuning the chemistry of oligomers, com- 
posite membranes comprising a series of alkane 
MOA nanofilms were fabricated. The toluene 
permeability for MOA membranes comprising 
both fluorine and alkane series is shown in 
Fig. 3F. Within the same series, the toluene 
permeability remained unaltered, independent 
of the nanofilm thickness (Fig. 2H and fig. S27). 
Across different series, membranes made from 
the alkane MOAs provided more than double 
the permeability of the fluorine MOAs with- 
out compromising the rejection (fig. S38). 
This illustrates that the oligomer chemistry 
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in MOAs can tailor the permeance and per- 
meability of the resulting membranes, which 
suggests that MOA membranes with desig- 
nable oligomer functionality could be devel- 
oped for dedicated separation tasks (27). 

The ultrafast transport of nonpolar solvents 
and the material tunability make MOA mem- 
branes a promising candidate for fractiona- 
tion of small hydrocarbon liquid molecules. 
Challenging the MOA membranes with a syn- 
thetic crude oil composed of a group of hy- 
drocarbon molecules in a cross-flow filtration 
system (fig. S39 and table S4), we found that 
they demonstrated good separation of mole- 
cules on the basis of differences in size and class 
(Fig. 4A). In contrast to conventional binary 
separation systems such as polystyrenes in hep- 
tane, there is no designated solvent or solute 
in complex crude oil mixture fractionation. 
Thus, the membrane selectivity is presented 
as the ratio of permeate to retentate concen- 
trations (C,/C,) for each component plotted 
against the molecular weight of the compo- 
nents. After >24 hours of continuous operation, 
the membranes made from MOAs, especially 
C3Ne¢C3 membranes, demonstrated much bet- 
ter size- and class-based selectivity compared 
with those of the ONf-2 membrane, which is 
the current commercial benchmark used for 
processing hydrocarbon liquids (Fig. 4A, fig. 
S40, and table S5). We speculate that the flex- 
ible alkane chains of C3NgC3 in the membrane 
have a strong interaction with “structure-like” 
hydrocarbon molecules, which results in the 
retention of the larger species (5, 27). More- 
over, when elevating the system temperature 
from 22°C to 50°C, C3NgC3; membranes dis- 
played good thermal stability and increased 
permeance without compromising the selec- 
tivity, whereas ONf-2 membranes showed de- 
creasing selectivity at high temperature (fig. 
S41). The trade-off between permeance and 
selectivity of the largest molecule, isocetane, 
for membranes tested with the synthetic crude 
oil is shown in Fig. 4B. The performance of 
most MOA membranes exceeded the upper 
bound compared with that of commercial 
membranes (table S6). This is due to the ultra- 
thin MOA nanofilms formed through inter- 
facial polymerization (22), which enable faster 
transport than that of commercial membranes 
with thick separating layers formed through 
phase inversion (7). 

MOA membranes were further challenged 
with a real light shale-based crude oil. For 
this process, the SBAD-1 membrane is the 
current state-of-the-art membrane, with the 
highest permeance reported in literature 
(5, 6). The permeance of the Fi3N¢F,3 mem- 
brane was more than five times higher than 
that of SBAD-1 membrane, even operating at 
room temperature (30°C), whereas a high tem- 
perature (130°C) was needed for the SBAD-1 
membrane to achieve comparable selectivity 
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(Fig. 4C) (5). This is attributed to the ultrathin 
thickness of nanofilms fabricated by means of 
interfacial polymerization, whereas hundreds- 
of-nanometers-thick active layers were formed 
in the SBAD-1 membrane through the spin 
coating technique. The enhanced liquid per- 
meance can substantially reduce the required 
membrane area, plant footprint, and hence 
the capital cost for any specific refinery capac- 
ity (6). The boiling point distribution of the 
feed, permeate, and retentate from the light 
shale-based crude is shown in Fig. 4D. The 
permeate has a cutoff at 550°C, whereas the 
feed and retentate boiling point range ex- 
tends to 700°C. This demonstrates that the 
membrane rejects the heavy (residue) frac- 
tion of the crude oil. At the same time, the per- 
meate was found to be appreciably enriched 
in the lighter molecules present in the feed 
so that >60 weight % (wt %) of the permeate 
had a boiling point below 400°C compared 
with 48 wt % observed in the feed. A two- 
dimensional gas chromotography (GCxGC) 
difference plot for the permeate and reten- 
tate samples further demonstrated that the 
permeate was lighter than the retentate, with 
high concentrations of light saturates and light 
aromatic molecules in the permeate, whereas 
there is rejection of heavier three- and four- 
ring aromatics (fig. S42). These results high- 
light the capabilities of MOA membranes in 
crude oil fractionation, providing enrichment 
of high-value light molecules and rejection of 
low-value heavy molecules while exhibiting the 
high permeances that industrial application 
demands. To demonstrate the scalability of this 
approach, conventional interfacial polymeri- 
zation was carried out to fabricate in situ MOA 
membranes directly on the support (fig. $43). 
They showed satisfying separation performance 
in synthetic crude oil (fig. S44), which rein- 
forces the potential for using these materials 
in large-scale industrial applications. 

To understand the origin of the high selec- 
tivity by using MOA membranes in hydrocar- 
bon separation systems, a modeling approach 
based on diffusion theory is proposed in the 
supplementary text, section 3. The model al- 
lows for the representation of membrane-based 
fractionation of crude oil on the basis of a sin- 
gle parameter, and it was applied to separa- 
tions by using fluorine and alkane oligomers 
(tables S7 and S8). The model gives a good fit 
of the light shale-based crude oil separation 
data with the experimental results for the fluo- 
rine series (F;3NgFj3 membranes) and alkane 
series (CgNgCg membranes), both having a 
crossover at molecular weight of ~200 g mol’ 
(Fig. 4E). The F,,N¢Fi;3 membranes show a bet- 
ter retention for large hydrocarbon molecules 
than that of the CsNgCg membranes, which is 
consistent with the permeate boiling point cut- 
off results at 10% stage cut, 550°C for Fi3N¢F is 
membranes and 590°C for CgNgCs membranes 


(fig. S45 and table S8). The increased retention 
of heavy species with F,,N¢F;3 membranes re- 
sults from the low solubility of hydrocarbons in 
fluoropolymers (28). 

By using MOAs for interfacial polymeriza- 
tion, we have fabricated ultrathin hydrophobic 
polyamide nanofilm membranes that incor- 
porate vesicles from the self-assembled MOAs. 
In processing hydrocarbon liquids, these mem- 
branes are more permeable, selective, and stable 
than commercially available and other literature- 
reported membranes. We also highlight the 
feasibility of manipulating the crude oil sep- 
aration performance at the molecular level by 
introducing oligomers with different chemis- 
tries into the MOA molecules. This work facil- 
itates the development of the next-generation 
hydrophobic nanofilms created through inter- 
facial polymerization by incorporating rational 
molecular-level design and with the potential 
for scaling up. 
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POLYMER CHEMISTRY 


Catalytic deconstruction of waste polyethylene with 


ethylene to form propylene 


Richard J. Conk'?+, Steven Hanna?*}, Jake X. Shi>?, Ji Yang”, Nicodemo R. Ciccia’?, Liang Qi, 
Brandon J. Bloomer’, Steffen Heuvel’, Tyler Wills’, Ji Su2, Alexis T. Bell°, John F. Hartwig'?* 


The conversion of polyolefins to monomers would create a valuable carbon feedstock from the largest 
fraction of waste plastic. However, breakdown of the main chains in these polymers requires the 
cleavage of carbon-carbon bonds that tend to resist selective chemical transformations. Here, we report 
the production of propylene by partial dehydrogenation of polyethylene and tandem isomerizing 
ethenolysis of the desaturated chain. Dehydrogenation of high-density polyethylene with either an 
iridium-pincer complex or platinum/zinc supported on silica as catalysts yielded dehydrogenated 
material containing up to 3.2% internal olefins; the combination of a second-generation Hoveyda-Grubbs 
metathesis catalyst and [PdP(‘Bu)3(-Br)]2 as an isomerization catalyst selectively degraded this 
unsaturated polymer to propylene in yields exceeding 80%. These results show promise for the 
application of mild catalysis to deconstruct otherwise stable polyolefins. 


he mechanical durability and chemical 
inertness of commercial plastics have 

led to their widespread use in everyday 

life, but these same properties make 
them difficult to recycle. Polyolefins, 
which account for 36% of global plastic pro- 
duction (J), are particularly durable and inert, 
and only a small proportion of polyethylene 
(PE) waste is recovered by recycling (~14% in 
2015) (2). Polyolefins are prepared with dis- 
tinct molecular weights, molecular weight 
distributions, and microstructures for spe- 
cific applications. Thus, a mixture of polyolefins 
individually produced for diverse applications 
lacks the properties of any one component; 
moreover, accumulated additives and contam- 
inants, as well as chain scission induced by 
mechanical recycling, can degrade performance 
(2). Given the social, economic, and ecological 
threats from the accumulation of mixed poly- 
olefin waste, innovative approaches to use this 
material as a carbon feedstock are needed (2-4). 
Our group (5) and others (6-8) have been 
investigating catalytic chemistry that could 
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occur with these polymers that contain only 
alkyl C-H and C-C bonds. Because the C-H 
bonds are more reactive than the C-C bonds, 
many efforts have focused on the introduction 
of functional groups onto the polymer chain to 
tune the properties of the plastic (5, 7, 8). More 
challenging are methods for “chemical recy- 
cling” of polyolefins, which entails the cleav- 
age of polymer chains to regenerate monomers 
that can be repolymerized (6, 9). Whereas 
processes are well established to cleave the 
C-O bonds of polyesters (10), methods to 
cleave the alkyl C-C bonds of PE are more 
limited. Pyrolysis (4), catalytic cracking (17), 
hydrogenolysis (12-15), tandem hydrogenolysis- 
aromatization (76), and alkane metathesis 
(17, 18) have been reported for cleavage of 
PE chains. However, these processes form 
complex mixtures of hydrocarbons that are 
difficult to separate and have varying levels of 
utility (Fig. 1A). A process is needed by which 
PE undergoes depolymerization to form a 
single product for which the demand ap- 
proaches the supply of PE waste. A method 
for the selective production of a commodity 
olefin from PE could thus render PE a val- 
uable hydrocarbon feedstock. 

We envisioned a cascade of catalytic pro- 
cesses that would introduce unsaturation to 
a PE chain and then unravel the chain to 
propylene. More specifically, dehydrogenation 


could be used to enable subsequent C-C bond 
cleavage by a combination of olefin metathesis 
and isomerization to form propylene (Fig. 1B) 
(19, 20). We show that the combination of de- 
hydrogenation of high-molecular-weight PE 
having high density (HDPE) or low density 
(LDPE) and subsequent isomerizing ethenolysis 
(//E) of the resulting crude polymer containing 
internal alkenes converts PE with commer- 
cially relevant molecular weights to propene 
in yields as high as 80%. This dehydrogenation 
and isomerizing ethenolysis (DIE) process was 
successfully tested on both postconsumer and 
virgin PE dehydrogenated by either homoge- 
neous or heterogeneous catalysts. 

To determine whether the conversion of PE 
to propylene could be achieved by a cascade 
comprising DIE, a reliable dehydrogenation 
reaction was first needed. The formation and 
isolation of desaturated PE by catalytic de- 
hydrogenation has been reported only with 
materials of low molecular weight [number- 
average molecular weight (V/,,) = 606] to 
ensure solubility (27, 22). Even a recently re- 
ported combination of stoichiometric halo- 
genation and elimination was conducted with 
material with an M,, below 6.6 kDa (18). Many 
alkane dehydrogenation catalysts have been 
reported since these initial reports, and we 
investigated those containing phosphinous 
acid, bis(1,1-dimethylethyl)-, 5-methoxy-1,3- 
phenylene ester, which is a symmetric phosphi- 
nite pincer ligand containing an aryl backbone 
known as "POCOP, and related pincer li- 
gands (23). Dehydrogenations of virgin HDPE 
and LDPE with more commercially relevant 
molecular weights (M,, 26.1 kDa, D = 3.31 
and M,, = 35.1 kDa, D = 6.54, respectively) 
catalyzed by Ir-"®"POCOP with tert-butyl 
ethylene (TBE) as the hydrogen acceptor in 
p-xylene solvent yielded unsaturated polymer 
containing exclusively internal olefins (<0.1% 
terminal and up to 3.2% internal), along with 
small amounts (<0.75%) of alkylarenes (Fig. 2A), 
as determined by ‘H nuclear magnetic reso- 
nance spectroscopy. The isomerization of ter- 
minal olefins (which might be the kinetic 
product) to internal olefins is well known 
from dehydrogenation of liquid alkanes with 
this catalyst (24). The 3.2% of unsaturated 
units corresponds to a modest 7.6 turnovers 
but was limited by consumption of the acceptor 
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and generated sufficient alkene units for the 
subsequent steps of the DIE process. Analy- 
sis of the molecular weight by gel-permeation 
chromatography revealed a decrease in M, 
of the HDPE from 26.1 to 18.2 kDa upon 
dehydrogenation, but this change was not 
detrimental to our envisioned degradation to 
propylene. 

To determine whether heterogeneous cat- 
alysts also could desaturate polyolefins, we 
investigated the dehydrogenation of PE with 
catalysts inspired by those in the Oleflex process 
of Honeywell UOP (formerly Universal Oil 
Products) for the dehydrogenation of long-chain 
paraffins (25). We tested the dehydrogenation 
of HDPE with established Pt/Sn (25) anda 
new type of heterogeneously supported Pt/Zn 
catalyst (26). Heating of the polymer melt over 
such catalysts under a flow of Ar at 350°C 
furnished unsaturated PE containing up to 
0.6% olefins, along with ~0.25% arenes (Fig. 
2B). Again, just a slight reduction in M, was 
observed (from 26.1 to 21.7 kDa), and all of 
the olefin units were internal. 

Having demonstrated the feasibility of the 
dehydrogenation of high-molecular-weight PE, 
we synthesized a series of dehydrogenated PEs 
with varying levels of unsaturation. To do so, 
we subjected laboratory-grade HDPE to de- 
hydrogenation with Ir-™"POCOP and varying 
loadings of TBE as the sacrificial Hz acceptor 
(see table S2 for full details). These reactions 
produced PE with degrees of unsaturation 
ranging from 0.6 to 3.2%. 

It was unclear whether high-molecular- 
weight, unsaturated PE would undergo ethe- 
nolysis to form long-chain terminal alkenes 
for subsequent I/E. Subjecting the unsaturated 
HDPE containing 1.09% olefin units to ethenolysis 
with 3.6 mol % of second-generation Hoveyda- 
Grubbs catalyst M1 resulted in a reduction 
of the M,, from 18.1 to 1.6 kDa. The M,, of the 
cleaved product corresponds to an average 
chain length of 114 carbons (Fig. 2C). To model 
the process of dehydrogenation and ethenolysis 
on a pure reactant and to gain information on 
the position of the alkene units and the result- 
ing chain lengths, we conducted these reac- 
tions on tetracosane (C24H;0), a pure, long-chain 
alkane. Initial dehydrogenation of tetracosane 
with 1 equivalent of TBE converted >95% of the 
acceptor to tert-butyl ethane and converted 
~15% of the alkane to internal tetracosenes. 
Subsequent ethenolysis of the reaction mix- 
ture catalyzed by M1 under 25 bar of ethylene 
formed the distribution of olefins, including a 
wide range of chain lengths shown in Fig. 2D. 
This result indicates that dehydrogenation of 
this long alkane with Ir-tBuPOCOP produces 
predominantly internal alkene, rather than 
terminal alkene, units. 

Having shown that we could generate long- 
chain alkenes from PE, we investigated con- 
ditions for I/E of the resulting chains. I/E of 
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Fig. 1. Strategies for the valorization of PE waste. (A) Degradation of PE by alkane metathesis with 
n-hexane. (B) Conversion of PE to alkyl aromatics by tandem hydrogenolysis/aromatization. (©) Degradation 
of PE by hydrogenolysis. (D) Conversion of PE to propylene by DIE (where x is the number of I/E turnovers). 


Mes, mesitylene (2,4,6-trimethylbenzene). 


hydrocarbons is known for the conversion of 
unsaturated alkylarenes to vinylarenes and 
for the conversion of 1-octadecene to propylene 
at low conversions. We sought an I/E that 
would transform the long-chain PE fragments 
to propylene in high yield by first seeking 
conditions for high conversion of 1-octadecene 
(1a) (19). Several combinations of complexes 
known to catalyze either olefin metathesis 
or isomerization were tested, and the results 
are summarized in table S1. The reaction of 
1-octadecene with ethylene catalyzed by the 
combination of Hoveyda-Grubbs catalyst M1 


(6 mol % relative to alkene chains, 0.67 mol % 
relative to ethylene subunits) and the dimeric 
Pd) bromide catalyst I1 (3 mol %, 0.33 mol % 
relative to ethylene subunits) at 60°C for 16 hours 
formed propylene in 95% yield. 

Catalyst systems created from other combi- 
nations of individual metathesis and isomeri- 
zation catalysts led to lower yields of propylene 
than those with M1 and I. Systems containing 
three alternative metathesis catalysts (M2 to 
M4 in table S1) with I formed <60% propylene. 
Those containing the combination of M1 and 
one of three other isomerization catalysts (I2 to 
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Fig. 2. Conditions tested for DIE. (A) Homogeneous dehydrogenation of PE: 0.42 mol % Ir-®“POCOP 


(9.8 wt %), 0.46 mol % NaOtBu, 0.6 M in PE; reaction 


s were run for 12 hours with 0.4 equivalents of TBE. 


Mol %, equivalents. TBE and M of PE were calculated relative to PE monomer units. (B) HDPE (M,, = 26.1 kDa) 
with bimetallic catalysts supported on alumina. Conditions: 20 wt % catalyst, 350°C. Catalysts were first 
activated under Hp flow for 1 hour, and then dehydrogenation was conducted for 16 hours under Ar flow. 

(C) Gel-permeation chromatogram of HDPE before and after ethenolysis with M1. Conditions for ethenolysis 
were 0.36 M in unsaturated PE 3.6 mol % M1 in p-xylene (relative to PE monomer units), heated at 130°C for 
16 hours under 25 bar ethylene. (D) Sequential dehydrogenation and ethenolysis of a long-chain paraffin 
(tetracosane). Initial dehydrogenation yields olefins in 10% yield, and subsequent ethenolysis with M1 yields a 


statistical distribution of shorter-chain olefins. 


14 in table S1) formed <10% propylene. All 
of the isomerization catalysts furnished ex- 
clusively internal octadecenes from 1-octadecene 
in high or quantitative yield after 16 hours at 
60°C in p-xylene, implying that incompatibility 
of the isomerization catalysts with the metathe- 
sis catalyst or with ethylene is the cause of the 
lower yields of propylene with alternative 
isomerization catalysts I2 to I& (see table S1 
for details). 

The free energy of isomerizing ethenol- 
ysis is estimated to be negative but small 
(-1.52 kcal/mol of propylene formed under 
standard conditions; see the supplementary 
materials for details). Thus, our standard re- 
actions that gave 95% yield of propylene were 
conducted in a pressure vessel with sufficient 
headspace to provide a large excess (64 equiv- 
alents per methylene unit) of ethylene to favor 
full consumption of the alkene chain. Con- 
sistent with the estimated thermodynamics, 
reactions with a smaller headspace providing 
just 3.5 equivalents of ethylene in the system 
also led to a high, albeit slightly lower, 72 + 6% 
yield of propylene. 

Finally, we tested the formation of propylene 
from ethylene and the unsaturated PE samples 
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that we produced by dehydrogenation. The re- 
actions of ethylene with the material from 
HDPE catalyzed by the two complexes in our 
I/E system formed propylene in yields shown 
in Fig. 3B. Reactions with HDPE containing 
only 0.68% unsaturation formed propylene in 
nearly 60% yield [turnover numbers (TONs) 
of 34 and 55 in MI and II, respectively]. 
Reaction of the polymer in which 1.9% of the 
monomer units were unsaturated led to the 
formation of propylene in 87% yield (48 and 
79 TON in M1 and II, respectively). The con- 
version of unsaturated LDPE to propylene also 
occurred under these conditions in yields up to 
70% (39 and 64: TON in M1 and IL, respectively). 

We considered that ethylene could undergo 
dimerization or oligomerization, followed by 
ethenolysis, to form propylene without contri- 
bution from the polymer chains. Indeed, we 
did observe this background reactivity in the 
absence of long-chain olefins to form 0.13 mmol 
of propylene under our reaction conditions 
after 16 hours at 60°C and to form 1.01 mmol 
of propylene after the same reaction time at 
130°C. Thus, to determine whether the ob- 
served propylene from the reactions with PE 
originated from dehydrogenated polymer or 


from the background conversion of ethylene 
to propylene, we designed an isotope-labeling 
experiment in which the production of pro- 
pylene could be observed more directly. Be- 
cause both dehydrogenation and I/E scramble 
°H labels, we conducted experiments with 
C-labeled PE to assess the extent of for- 
mation of propylene from polymer clearly. 
The propylene arising from dehydrogenated 
PE by the proposed scheme would consist 
entirely of the singly C-labeled isotopomer. 
This isotopic composition would be observed 
because scrambling would occur at the termini 
of propylene by olefin metathesis during the 
I/E, but scrambling at the 2 position would 
not (Fig. 3C). Accordingly, we subjected 
C-labeled (99.9% '°C) HDPE to DIE and 
characterized the resulting propylene by mass 
spectrometry. The mass spectrum revealed an 
isotopic enrichment of the propylene >70%, 
demonstrating that most of the propylene 
observed from the DIE of PE results from 
polymer chain degradation. Small amounts of 
propylene containing two “C carbons that results 
from metathesis of the initial propylene product 
with the C ethene formed in situ were also 
detected, but this material accounted for just 
1% of the propylene product. 

We also conducted DIE on samples of post- 
consumer PE (Fig. 3E). Substantial yields of 
propylene were observed in all cases. More- 
over, HDPE from a milk jug yielded 57% pro- 
pylene, which is only 20% lower than the 
yield from virgin HDPE with a similar degree 
of unsaturation, and LDPE food packaging 
generated >50% yield of propylene. To inves- 
tigate the effect of additives on the perfor- 
mance of the catalytic system, these experiments 
were conducted by two methods. In the first 
method, postconsumer samples were coarsely 
cut with scissors and dehydrogenated directly 
using Ir-'™"POCOP under the conditions spe- 
cified in Fig. 2A. In the second method, the 
cut samples were first dissolved in boiling 
xylenes and then precipitated from acetone 
to remove additives and impurities. In some 
cases, the additives and impurities had no 
effect, and in some cases, such as those from 
can tops and food packaging, the simple pre- 
cipitation was needed for dehydrogenation to 
occur. These results show that postconsumer 
products comprising HDPE or LDPE can be 
converted to propylene by this protocol, and 
that our system is sufficiently robust to tolerate 
the additives present in postconsumer products 
while still furnishing substantial amounts of 
propylene. 

The time courses of reactions for degrading 
octadecene at 60°C and degrading HDPE at 
130°C are displayed in Fig. 4, A and B, respec- 
tively. These data revealed a lengthy induction 
period for the reaction of octadecene at 60°C, 
presumably because of the multiple stages of 
catalyst activation necessary for the reaction to 
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Fig. 3. Development of conditions for DIE of PE. (A) Application of I/E to 
dehydro-PE, with dehydro-HDPE yielding a maximum of 80% propylene with 
1.96% dehydrogenation. Conditions for I/E of dehydrogenated PE: 3.6% M1, 


2.2% I, 1.78 M in PE, heated at 130°C in p-xylene for 
ethylene. Catalyst loadings are reported as mol % relat! 


occur at an appreciable rate. This plot fit well 
to an exponential formation of catalyst and a 
zero-order appearance of product until the 
substrate was depleted. The reaction of the un- 
saturated HDPE at 130°C occurred with a shorter 
induction period despite the need to first cleave 
the internal alkenes of the unsaturated HDPE, 
and a constant rate was then observed up to 
85% of the total conversion. Presumably, the 
higher temperature diminishes the induc- 
tion period. The linear formation of propylene 
suggest that the process occurs by a rate- 
determining reaction with the excess ethylene. 

Additional data that we gathered on the 
degradation of PE by DIE point to future chal- 


ive to total PE repeat units. 


0°C 


Milk Bottle 
(HDPE) 1.41% 
56.8% (46.6) 


Shampoo Bottle 
(HDPE) 1.34% 
43.3% (39.5) 


16 hours under 25 bar of 


consistently furnished propylene in lower yields 
than did dehydrogenated HDPE with the same 
degree of unsaturation. We hypothesized that 
the significant degree of branching in LDPE 
led to the formation of trisubstituted alkenes 
during IE, and these more hindered alkenes 
underwent slower isomerization or slower 
ethenolysis. To test this hypothesis, we synthe- 
sized several substituted olefins and subjected 
them to I/E under conditions identical to 
those we used for the I/E of 1-octadecene (Fig. 
4C). The I/E of substrates 1a to 1d demonstrate 
a clear trend of decreasing propylene yield with 
increasing olefin substitution. 

Second, our experiments with virgin PE 


lenges. First, samples of dehydrogenated LDPE 
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(Fig. 3B) showed that a high degree of un- 


(25 bar) p-xylene, 130°C 


Packaging (LDPE) 


% Olefin 
% Yield 


1.46% 
50.4% 


(B) DIE of °C-labeled HDPE to check for background reactivity [conditions identical to 
those in (A)]. (C) DIE of postconsumer PE [conditions identical to those in (B)}. Propylene 
yields are reported as X% (Y) where X is the yield obtained from polymer purified by 
precipitation from acetone and Y is the yield obtained from unmodified polymer. 
Where a single yield is reported, dehydrogenation on unpurified samples failed. 


saturation of PE chains was detrimental to 
the yield of I/E. We hypothesized from this 
observation that a high degree of polymer 
unsaturation leads to dienes and polyenes 
that isomerize during the I/E process to con- 
jugated dienes that could poison the catalyst. 
Because there are multiple sites of unsaturation 
on each polymer chain, I1 could isomerize such 
olefins to conjugated 1,3-dienes within the 
polymer, and such conjugated dienes could 
poison one or both catalysts. To test this hy- 
pothesis, we conducted I/E on one a,w-diene, 
le (1,9-decadiene), and one conjugated diene, 
If (1,3-tetradecadiene) (Fig. 4D). These experi- 
ments showed that o-w dienes separated by six 
methylene units (le) formed propylene in 
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Fig. 4. Investigation of reaction scope and kinetics. (A) Time course of 

I7E with 1-octadecene (100% yield = 458.3 mbar). Conditions for I/E of 1- 
octadecene: 6.0 mol % M1 (0.67 mol % relative to ethylene subunits), 3.0 mol % 
11 (0.33 mol % relative to ethylene subunits), 0.500 M in olefin, heated at 60°C in 
p-xylene for 16 hours under 25 bar of ethylene. A fit for the time course was 
conducted using COPASI by simulating an exponential appearance of catalyst 
and a zero-order reaction (R* = 0.993, kcat = 0.071 $“, Kpropyiene = 18.0 $s”). 
(B) Time course of I7E with dehydrogenated HDPE (1.41% olefin) (100% 


yield = 193.5 mbar). A linear fit has been added to (B) to show the zero-order 
reaction after the short induction period and completion of the reaction (R* = 
0.989). (C) Yields of I/E on small molecules 1a to 1d designed to assess the 
effect of branching on I/E. (D) Yields of I/E on small molecules 1e and 1f 
designed to test the effect of dienes on I/E [conditions identical to those 
specified in (A)] and on PCO (M,,= 5.2 kDa) designed to demonstrate the effect of 
polymer unsaturation on I/E yield in the absence of branching. Conditions for 
I7E in (C) and (D) were identical to those in Fig. 3C. 


much higher yields than did 1,3-dienes. Con- 
sistent with our hypothesis, the addition of 
catalytic amounts of catalyst I1 to conjugated 
diene 1f without ethylene or the metathesis 
catalyst led to the immediate formation of a 
black precipitate (Fig. 4D). 

We assessed this hypothesis further by 
applying I/E to poly(cyclooctene) (PCO). We 
synthesized PCO (M,, = 5.2 kDa, D = 1.64) by 
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ring-opening metathesis polymerization of 
cyclooctene (27). I/E of this substrate fur- 
nished propylene in significantly diminished 
yield, but I/E of a partially hydrogenated PCO 
(28) (with an olefin content of 12.2% versus 
25% for the original polymer) furnished pro- 
pylene in 38.5% yield (Fig. 4D). We concluded 
from these data that the formation of 1,3- 
dienes in situ was likely responsible for the 


lower yields of propylene from the more highly 
unsaturated polymers. 

Propylene is used as a precursor to multiple 
commodity chemicals, including acrylic acid, 
acrylonitrile, and cumene, in addition to its 
use in the production of polypropylene plastic 
(28). Although it is produced primarily as a by- 
product of steam cracking or fluid catalytic 
cracking, the increasing demand for propylene 
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since the 1990s has motivated its “on-purpose” 
production through methanol-to-olefins tech- 
nology, propane dehydrogenation, or the 
conproportionation of butene and ethylene 
(29). This on-purpose production is only ex- 
pected to increase as increasing populations 
and standards of living heighten the demand 
for polypropylene. 

The constituent reactions of the DIE process 
that we report on laboratory scale with homo- 
geneous catalysts are closely related to reac- 
tions conducted on industrial scales with 
heterogeneous catalysts, and their combi- 
nation draws analogies to the combination 
of reactions in the commodity-scale shell 
higher olefin process (SHOP) that forms long- 
chain alkenes from ethylene (29). The olefin 
metathesis and isomerization by which this 
degradation of the unsaturated polymer oc- 
curs are some of the most practiced industrial 
transformations and are routinely performed 
on commodity scales with heterogeneous cat- 
alysts (30, 31). Thus, the capacity of such 
systems to catalyze the DIE processes would 
allow dehydrogenation and I/E to be conducted 
in continuous flow with less ethylene while 
removing the propylene product. The sepa- 
ration of propylene from ethylene is already 
conducted on the industrial scale by various 
methods, including low-temperature and 
extractive distillation (32). Thus, this ini- 
tial report charts a path by which a DIE 
sequence could render polyolefin waste a 
hydrocarbon feedstock to generate a commodity 
chemical for which the demand is comparable 
to the available supply of recovered polyolefins. 
The use of heterogeneous catalysts is under 
development. 
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METHANE EMISSIONS 


Inefficient and unlit natural gas flares both emit 


large quantities of methane 


Genevieve Plant", Eric A. Kort'*, Adam R. Brandt’, Yuanlei Chen, Graham Fordice’, 
Alan M. Gorchov Negron’, Stefan Schwietzke®, Mackenzie Smith‘, Daniel Zavala-Araiza®® 


Flaring is widely used by the fossil fuel industry to dispose of natural gas. Industry and governments 
generally assume that flares remain lit and destroy methane, the predominant component of natural gas, 
with 98% efficiency. Neither assumption, however, is based on real-world observations. We calculate flare 
efficiency using airborne sampling across three basins responsible for >80% of US flaring and combine 
these observations with unlit flare prevalence surveys. We find that both unlit flares and inefficient 
combustion contribute comparably to ineffective methane destruction, with flares effectively destroying 
only 91.1% (90.2, 91.8; 95% confidence interval) of methane. This represents a fivefold increase in 
methane emissions above present assumptions and constitutes 4 to 10% of total US oil and gas methane 
emissions, highlighting a previously underappreciated methane source and mitigation opportunity. 


n oil and gas (O&G) production and pro- 
cessing, natural gas—most commonly, as- 
sociated natural gas (a by-product of oil 
extraction)—is disposed of through flares 
for a variety of safety, infrastructure, regu- 
latory, or economic reasons (J, 2). Ideally, the 
combustion in a flare serves to convert the 
mostly hydrocarbon gas to carbon dioxide 
(CO) and water (1). The primary component 
of natural gas is typically methane (CH,), the 
most important short-lived greenhouse gas 
owing to its high radiative forcing and global 
warming potential (3), relatively short atmo- 


spheric lifetime of 9 to 10 years (4, 5), and 
contribution to formation of tropospheric 
ozone (6, 7). The use of flares aims to reduce the 
climate and health impacts of the disposal of 
this waste gas. Global flaring activity is captured 
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Fig. 1. Flaring activity in the United States. (A) Flaring locations in the 
contiguous US as seen from space (VIIRS, red triangles) for 2020 (12) and airborne 
sampling locations (F°UEL, black circles), investigating the regions of the 

Bakken (yellow), Eagle Ford (green), and Permian (purple) from 25 August 2020 
through 4 September 2020 and 28 June 2021 through 15 July 2021. (B) Temporal 
evolution of total gas volumes flared (bcm) in the VIIRS data product in the 


in the Visible Infrared Imaging Radiometer 
Suite (VIIRS) Nightfire dataset, where satellite- 
based nighttime imagery in five spectral bands 
is used to calculate the temperature, radiant 
heat, and source area of individual flares 
(8-10). From these measurements, flare gas 
volumes are inferred (J/-13). Using these data, 
the World Bank estimates that 142 billion cubic 
meters (bem) of gas was flared globally in 
2020, with the United States having flared 
11.8 bem (74). The US consistently ranks among 
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Eagle Ford 


T 
Permian 


the top five flaring nations in the world (74), 
with >80% of gas volumes flared in the US 
occurring within three regions: the Permian 
Basin, the Bakken Formation in the Williston 
Basin, and the Eagle Ford Shale in the Western 
Gulf Basin (hereafter referred to as the Permian, 
the Bakken, and the Eagle Ford; Fig. 1A). These 
three regions, in aggregate, drive national 
flaring trends (Fig. 1B) and are the primary 
observational focus of measurements presented 
herein, collected as part of the Flaring & Fossil 


FSUEL Regions 


US Total 


Bakken, Eagle Ford, and Permian, as delineated by the shaded regions in (A) (13). 
In 2020, VIIRS observed 665, 547, and 1710 flares in the Bakken, Eagle Ford, 

and Permian, respectively. The three areas in aggregate are the focus of this work 
(FSUEL regions) and represent 80 to 90% of flared volumes in the US. At the 
national scale, flare volumes represent ~1% of natural gas gross withdrawals and 
generally track annual trends (fig. S1). 


Fuels: Uncovering Emissions & Losses (F*°UEL) 
project (Fig. 1). 

In 2018, Alvarez et al. presented a synthesis 
study considering field measurements made 
over the previous decade. They concluded that 
considerable upward revisions of CH, emis- 
sions estimates were needed for a variety of 
segments within the O&G sector (15). How- 
ever, given a lack of real-world observations, 
estimates of flaring emissions were not re- 
evaluated. Estimates of flare-related emissions 
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Fig. 2. Airborne flare sampling. 
(A) Flight track from the 

Bakken on 29 June 2021 colored 
by the COz signal [parts per 
million (ppm)]. The white 
squares indicate the CO2 peaks 
attributed to the upwind flare, 
with the wind direction (measured 
by the aircraft) indicated by 

the white arrow. The red star 
indicates the approximate 
location of the flare. (B) The 
corresponding time series of 

COz (ppm) and CH, (ppm), with 
the red squares indicating the 
location of the combustion plume 
intercepts identified by the 

CO. peak-finding algorithm 
(further details in the materials 
and methods). 


Co, (ppm) 
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Table 1. Basin-level flare effective CH, destruction removal efficiency. 


Observed flare DRE 
Mean (%) 95% Cl 
95.4, 97.4 


Region 


Eagle Ford 


Average 94.3, 95.9 


*Combines observed unlit flare statistics and DRE of lit flares 
work) and Permian surveys [Lyon et al. (33)] 


are generally based on the dual assumptions 
that flares operate continuously and that 98% 
of the CH, in the flare gas is destroyed through 
combustion (/6), a value based on limited US 
Environmental Protection Agency controlled 
studies conducted in the 1980s (17, 78). Neither 
assumption is supported by measurements of 
real-world flare operations. 

Although VIIRS provides global flare vol- 
ume information for flares large enough to 
be viewed from space, this product does not 
evaluate flare performance. The operation of 
individual flares in response to specific envi- 
ronmental and gas composition variables 
has been evaluated in laboratory and testing 
facility studies (19-25), however, only a handful 
of in situ measurements to characterize per- 
formance (i.e., efficiency) have been made of 
real-world flares (26-30). A smaller subset of 
these studies estimate flaring efficiency in terms 
of CH, removal. An early work based on remote 
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Total effective DRE" 
Mean (%) 95% Cl 
92.4 91.3, 93.3 


Unlit flares (%) 


90.2, 91.8 


tAverage of unlit flare rate observed in the Bakken (this 


sensing of three flares found combustion effi- 
ciencies of >99% on average (28), whereas more 
recent airborne-based studies in the US have 
found mixed results. Caulton et al. sampled 
10 flares in North Dakota and Pennsylvania, 
with a sampling bias toward the largest and 
brightest flares, and found that all were >99% 
efficient (29). Gvakharia et al. measured 37 flares 
in the Bakken and found that their efficiencies 
followed a skewed distribution with a median 
efficiency of 97% (mean: 95.2%), which, if rep- 
resentative of flare performance for the entire 
basin, would more than double the CH, flare 
emissions from the region (30). The limited 
statistics of these previous studies undermines 
the ability to extrapolate to larger scales, and, 
as such, this emission source has not received 
much attention as a methane emissions miti- 
gation opportunity. 

In addition to the efficiency of combusting 
flares, unlit flares (.e., those that directly vent 


200 400 


Relative time (s) 


hydrocarbons to the atmosphere as a result of 
the flame extinguishing or never being lit) 
must also be considered. Data about the inci- 
dence rates of unlit flares remain incomplete. 
To date, the largest evaluation of unlit flares 
in the US is available through the Permian 
Methane Analysis Project (PermianMAP), an 
endeavor by the Environmental Defense Fund 
to quantify and locate CH, emissions in the 
Permian (31, 32). Lyon et al. found that, on 
average, 4.9% of observed flares were unlit 
(33). It is not known, however, whether these 
statistics are representative of other regions 
and basins, particularly as the Permian has 
experienced unprecedented growth that has 
contributed to markedly high methane emis- 
sions (34). Also in the Permian, Cusworth e¢ al. 
found that plumes from active and inactive 
flares accounted for 12.1% of the detected 
plume emissions captured in their aerial im- 
aging study (35). In Canada, an airborne survey 
reported by Tyner and Johnson found that 
unlit flares accounted for 13% of the total CH, 
emissions quantified in their 2019 survey in 
northern British Columbia (36). These surveys 
highlight the important role unlit flares play 
in CH, emissions from the O&G sector, how- 
ever, we do not presently know the relative 
importance to methane emissions of unlit 
flares compared with inefficient combustion. 
Knowledge of the full climate impact of flaring 
could help determine efficient and effective 
mitigation efforts, guiding efforts to address 
issues with combustion efficiency as well as 
with unlit flare prevalence. 

As part of the F°UEL project, we quantified 
the performance of flares across three regions 
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responsible for the bulk of flaring in the 
US, namely the Permian, Eagle Ford, and 
Bakken (Fig. 1A). Using an aircraft-based 
sampling approach over 12 research flights in 
2020 and 2021, we directly measured methane, 
carbon dioxide, nitric oxide (NO), and nitrogen 
dioxide (NO,) in flare combustion plumes, from 
which we quantified flare performance through 
calculation of the CH, destruction removal 
efficiency (DRE) for each plume intercept. 
The assessment of nitrogen oxides, or NOx 
(NO + NO,), emissions and air quality impli- 
cations will be the subject of forthcoming 
work. Consistent with previous airborne flare 
sampling campaigns (29, 30), this method 
allows for direct measurement of the relative 
enhancements of the flare combustion plume 
without ground access to the flaring facility. 
An example flare-sampling flight track is shown 
in Fig. 2A, along with the accompanying con- 
centration time series in Fig. 2B (details giv- 
en in the materials and methods). With the 
airborne platform, we sampled more than 
600 intercepts of flare combustion plumes, 
representing >300 distinct flares across the 
three basins (table S1). This represents an 
order-of-magnitude increase in the amount 
of data on in situ measurements of real-world 
flare performance. 

Coincident measurements of CO, and CH, 
in the flare combustion plume allow for the 
calculation of CH, DRE, characterizing how 
well the flare’s combustion destroys CH,. DRE 
is distinct from combustion efficiency (CE)— 
DRE is the percentage of a specific species (e.g., 
CH,) converted to another (e.g., CO.), whereas 
CE is the percentage of total hydrocarbons in 
the gas that is converted to CO, (16). Follow- 
ing the methodology previously demonstrated 
in the literature (29, 30), we estimate the DRE 
of CH, for each individual intercept of a flare 
combustion plume (materials and methods). 
The distribution of CH. DRE (%) values for 
flare intercepts across the three basins of study 
is shown in Fig. 3. In reporting and inventory 
estimates, 98% is a common default value used 
to quantify the efficiency at which flares convert 
CH, in the fuel gas to CO, (2), and this is shown 
in Fig. 3 as the dashed vertical line. Similar dis- 
tributions result when flare intercepts are grouped 
and attributed to individual flares (fig. $2). 

For all basins, the observed DRE values ex- 
hibit a skewed distribution, such that the 
median observed values are close to 98%, how- 
ever, the effective efficiencies are much lower, 
with substantial contributions from flares 
whose DRE can drop as low as 60%. Lognormal 
distributions, which capture the heavy tail, 
fit the observed DREs far better than normal 
distributions (Fig. 3; fit parameters in table S3). 
This trend is consistent with results from a 
previous study in the Bakken that observed a 
similarly skewed DRE distribution among 
37 flares (30). Investigations into possible 
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Fig. 3. Performance of active flares. Observed CH, DRE distributions of individual intercepts of lit flares for 
the (A) Bakken, (B) Eagle Ford, and (C) Permian. Normal (red dotted line) and lognormal (blue solid line) 
fits to the distributions are shown as normalized to the mean and maximum counts, respectively, to overlap with 


the observed data. 


drivers of reduced DRE, such as wind speed 
(measured at the aircraft), flare volume and 
temperature (VIIRS), and estimated well age 
and gas/oil ratio (37) did not yield compelling 
explanatory relationships, suggesting that the 
combination of our airborne sampling and 
these supplemental datasets cannot explain 
most of the observed flare CH, DRE variability. 
Improving attribution to flare design, opera- 
tion, and environmental conditions would 
require a different study strategy, likely with 
more information on individual flare infra- 
structure and operation. 


Unlit flares directly venting unburned gas to 
the atmosphere as a result of the flame being 
extinguished or never properly ignited have an 
additional impact on the flaring CH, budget. 
Across three infrared optical gas imaging sur- 
veys (onboard a helicopter platform), Lyon et al. 
found that, on average, 4.9% of surveyed flares 
in the Permian were unlit and venting (33). In 
2021, we sampled 601 active flares in the Bakken 
as part of a ground-based infrared imaging sur- 
vey and found that 3.2% of active flares were 
unlit (materials and methods and table S6). 
Owing to a lack of unlit flare observations in 
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the Eagle Ford, in this study we assume the 
percentage of unlit flares is 4.1%, the average 
of Bakken and Permian fractions (sensitivity 
study in material and methods). 

To investigate CH, emissions from flares at 
the basin scale, we combined contributions of 
both inefficient performance and the preva- 
lence of unlit flares into a total effective DRE 
(Table 1). We expanded our observed DRE 
estimates to the basin level by randomly re- 
sampling (with replacement) the observed 
DRE distributions and applying those efficien- 
cies to the population of flares seen in VIIRS 
within each basin (delineated by the yellow, 
green, and purple regions shown in Fig. 1A), 
the results of which are shown in Table 1. We 
used a randomized approach, as we detected 
no relationship between observed DRE and 
flare size or temperature as quantified in the 
VIIRS product (see fig. S3). Additionally, we 
found that sampled flares represent the range 
of flare temperatures and sizes as seen by VIIRS 
(see fig. S4). From a bootstrap resampling (with 
replacement) of our observations, we inferred 
the uncertainty of our basin-average estimates, 
deriving 95% confidence intervals (CI). Unlit 
flares are assumed to have an efficiency of 0% 
(i.e., no combustion) and therefore directly 
reduce the effective DRE at the basin level by 
the unlit flare fraction (4.9%, Permian; 3.2%, 
Bakken; 4.1%, Eagle Ford). 

The inefficiency of combusting flares in 
combination with the rate of unlit flares re- 
sults in effective flare efficiencies that are 
considerably lower than 98% across all three 
basins. The standard value of 98% falls outside 
the confidence intervals for our basin-average 
DRE for lit flares. This indicates that flaring 
activities are a much larger part of the CH, 
O&G footprint than previously estimated. In 
addition, the relative contribution of both 
poorly combusting and unlit flares to total 
CH, flaring emissions is similar. 

The average observed DRE across the study 
is 95.2%, and the average total effective DRE 
after accounting for unlit flares is 91.1%. Using 
flare volumes estimated from VIIRS for these 
regions (3.8, 1.6, and 5.2 bem for the Bakken, 
Eagle Ford, and Permian, respectively, for 2020) 
(72, 13), along with the basin-specific unlit flare 
fraction and average gas composition from the 
Greenhouse Gas Reporting Program (see fig. 
S5), we estimate a flaring CH, emissions rate 
from lit flares in these three basins of 0.27 
(0.14, 0.38; 95% CI) Tg CH,/year. Accounting 
for the presence of unlit flares increases that 
estimate to 0.49 (0.26, 0.72; 95% CI) Tg CH,/ 
year for these high-flaring areas. These emis- 
sions estimates are considerably larger (~5x) 
than if we assume 98% DRE for all flares quan- 
tified by VIIRS and no occurrences of unlit 
flares (0.10 Tg CH,/year). 

Flaring in the Bakken, Eagle Ford, and 
Permian represents the vast majority of ac- 
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tivity in the US, ~84% (72, 13). If we assume 
that the average effective DRE and a gas com- 
position of 80% CH, is representative of the 
remaining flares, US CH, flaring emissions 
total 0.60 (0.31, 0.87) Tg CH,/year, again five 
times larger than current assumptions would 
suggest (0.12 Tg CH,/year), and a total emis- 
sion magnitude comparable with notable O&G 
areas such as the Four Corners region (Colorado 
and New Mexico, US) at 0.54 Tg CH,/year 
(+ 0.20; 1o) (38). The difference in US flare 
CH, emissions based on the observed effective 
DRE and 98%, 0.48 Tg CH,/year, represents a 
substantial underaccounting of CH, emis- 
sions from flares. If measures were taken to 
ensure that US flares operated at 98% ef- 
ficiency and remained lit, as current account- 
ing assumes, this would be equivalent to 
removing 2.9 million cars from the road each 
year these mitigation measures were in place. 
This calculation uses a 100-year CH, global 
warming potential (GWP) of 28 and an annual 
emission of a typical passenger vehicle of 
4.6 metric tons of CO, (39). Using a 20-year 
GWP of 84 (40) increases this mitigation mea- 
sure’s impact to the equivalent of removing 
8.8 million cars. Our estimate for flaring emis- 
sions in the US represents 4 to 10% of total 
CH, emissions inventoried by the US Envi- 
ronmental Protection Agency for the O&G 
sector [8.4 Tg CH,/year for 2020 in table 3-2 
of (4D). 

We do not know whether these US-based ob- 
servations are globally representative. Flaring 
in other regions with differing regulations and 
oversight may perform better or worse. Never- 
theless, our observationally derived perform- 
ance characteristics are an improvement over 
simplified assumptions of 98% with no basis in 
real-world observations. Recently, the Interna- 
tional Energy Agency (IEA) revised their esti- 
mate of global methane emission from flares 
to the assumption that flares operate at 92% 
efficiency, although the genesis of this value 
remains opaque (42). This assumed efficiency 
is similar to our US average effective CH, DRE 
of 91.1%. Using this updated efficiency assump- 
tion, the IEA estimated that flares were re- 
sponsible for releasing 8 Tg (8 million tons) 
of CH, in 2020 (42), which would be on par 
with the world’s methane ultra-emitters (43), 
equaling 8 to 11% of total global oil gas emis- 
sions (44). This highlights the need for more 
measurements of flare performance around the 
globe and represents a heretofore unrealized— 
or at least greatly underestimated—large source 
of methane emissions with known mitigation 
options. In the US, we find that both inefficient 
combustion and unlit flares contribute to sub- 
stantial methane emissions greatly exceeding 
standard estimates for flares. As a result, miti- 
gation efforts that address either combustion 
efficiency or unlit flares (such as operational 
practices on flare maintenance), in addition to 


reducing the usage of flares altogether (with 
alternatives such as reinjection or small-scale 
gas capture technology), would provide con- 
siderable methane emission benefits. Because 
flaring has long been treated as effective, there 
is little to no assessment of repair and main- 
tenance costs, particularly for the issue of unlit 
flares. In principle, addressing unlit flares 
presents a simple and cost-effective CH, miti- 
gation opportunity. Further investigation into 
the cause of flare malfunction could further 
inform mitigation measures and cost. 
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INDIANA UNIVERSITY 


ELOOMINGTON 


TENURED/TENURE-TRACK FACULTY POSITION IN VIROLOGY 
Department of Biology 
https://biology.indiana.edu/index.html 


The Microbiology Program in the Indiana University Department of Biology 
(https://biology.indiana.edu/graduate/microbiology/index.html) invites 
applications for a tenured/tenure-track position in Virology at the level of 
Assistant or Associate Professor. We are particularly interested in scientists 
studying emerging viruses. We encourage applications from candidates focused 
on the following areas of research: structure-function analysis of viral proteins, 
cell and molecular biology of events in viral replication, cellular responses 
to virus infection, and experimental approaches to virus evolution. Scientists 
working on model viral systems that inform our understanding of the biology 
of emerging viruses will also be considered. Start dates of Fall 2023 or Fall 
2024 are possible. 


This position is part of a significant, continuing expansion in the life sciences 
at IU Bloomington and represents an exceptional opportunity to join a strong 
Microbiology Program comprised of both Virologists and Bacteriologists. 
There also exists opportunities for interactions with programs in Molecular 
and Cellular Biochemistry, Genome, Cell and Developmental Biology, 
Evolutionary Biology, and Medical Sciences. The selected candidate will be 
provided with a competitive startup package and salary and will have access to 
outstanding research resources, including state-of-the-art facilities for genomics 
and bioinformatics, light and electron microscopy, flow cytometry, protein 
analysis, analytical chemistry, biophysical instrumentation, and crystallography. 
Applicants must hold a doctoral degree and have relevant postdoctoral 
experience with a strong record of research accomplishments. Successful 
candidates will be expected to develop a vigorous externally funded research 
program, and to participate in teaching at the undergraduate and graduate levels. 


For more detailed application instructions and conditions, please see 
https://indiana.peopleadmin.com/postings/13816. Please address inquires 
to Jennifer Tarter jenjones@indiana.edu. Applications received by October 
31, 2022 will be assured of full consideration; however, the search will 
remain open until a suitable candidate is found. 


wu 


INDIANA UNIVERSITY 


BLOOMINGTON 


TENURED/TENURE TRACK FACULTY POSITION IN EVOLUTION 
Indiana University Bloomington 


The Evolution, Ecology, and Behavior program in the Department of Biology 
at Indiana University Bloomington (IUB) invites applications for the newly 
endowed ‘Olsen Chair in Evolutionary Biology’, an open-rank tenured/ 
tenure-track position at the Assistant, Associate, or Full Professor level. 
We welcome applicants who address fundamental evolutionary questions 
using integrative approaches and methods, in any system, including from 
organismal, genetic, and/or theoretical perspectives. Start date is flexible, 
from Aug 2023 to Aug 2024. 


Applicants must hold a Ph.D. and have postdoctoral experience in relevant 
fields, with a strong record of research accomplishments. Successful candidates 
will be expected to develop a vigorous externally funded research program, 
and to participate in teaching at the undergraduate and graduate levels. The 
Department of Biology is committed to recruiting and retaining diverse 
faculty, and we strongly encourage applications from members of historically 
marginalized groups. We also welcome candidates who have demonstrated a 
commitment to working with people from groups that are underrepresented in 
STEM, through teaching, mentoring, or administration. 


Apply online at https://indiana.peopleadmin.com/postings/13961, by 
November 1, 2022. Please address inquiries concerning the search to Jennifer 
Tarter at 812-856-3984 or jenjones@indiana.edu. 


The Evolution, Ecology, and Behavior graduate program at [UB is a strong, 
integrative, and collegial group, ranked in the top 10 US EEB programs in 
2022: https://www.usnews.com/best-graduate-schools/top-science-schools/ 
ecology-rankings. Bloomington is a culturally diverse, welcoming city with 
‘a vibrant arts and music scene. The city is located among the hills, lakes, and 
forested landscape of south-central Indiana, with ample opportunities for 
outdoor activities and recreation. For information about the Department of 
Biology and for other links to the campus and the Bloomington community, 
see: http://www.bio.indiana.edu. 


IT’S NOT 
JUST A JOB. 


A CALLING. 


Q Find your next job at ScienceCareers.org 


Whether you’re looking to get ahead, get into, or just plain get 
advice about careers in science, there’s no better or more trusted 
authority. Get the scoop, stay in the loop with Science Careers. 


Science Careers 
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myIDP: 
Acareer plan customized 
for you, by you. 


THE UNIVERSITY 


— _\ 
A eof NORTH CAROLINA OPEN RANK FACULTY COMPUTATIONAL 
U at CHAPEL HILL MEDICINE-PHARMACOLOGY-GENETICS 


Position summary: The Computational Medicine Program and the Departments of Pharmacology and 


Genetics at the University of North Carolina at Chapel Hill (UNC-CH) invite applicants for two tenure- 
track faculty positions in Computational Medicine. The Computational Medicine Program at UNC-CH is a 
recently established research initiative that seeks to integrate modern computational approaches, including 
mathematical modeling, machine learning, computational genomics and network analysis, with cutting- 
edge experimental techniques to address problems relevant to human health. While the area of study is 
flexible, an ideal candidate would apply methods from systems biology and/or genomics to investigate 
molecular processes dysregulated in human diseases, and to eventually develop predictive models of these 
disease states. The successful candidate will hold a position, and research laboratory space, within the 
Computational Medicine Program, and also hold a position within either the Department of Genetics or 
Department of Pharmacology. They will be expected to direct an independent research program supported by 
extramural funding, and participate in team science projects initiated through the Computational Medicine 
Program. The successful candidate will also participate in graduate student mentoring and teaching at the 
graduate level. Applicant reviews will begin October 15 and continue until the positions are filled. 


online @sciencecareers.org 


There’s only one Science 


Features in myIDP include: 


= Exercises to help you examine your 
skills, interests, and values. 


« Alist of 20 scientific career paths 
with a prediction of which ones best 
fit your skills and interests. 


Minimum education and experience requirements: A Ph. D. or equivalent degree is required in Genetics, 
Pharmacology, Computational Biology, Computer Science, Statistics, Mathematics or related field. The 
successful candidates will have significant research experience applying computational approaches to 
problems in biomedical research. While postdoctoral experience is not required, 3-6 years of postdoctoral 
research is typical. Current Assistant and associate professors are also welcome to apply. 


Science Careers 


Visit the website and start 


mylDR planning today! 


myIDP.sciencecareers.org 


Preferred qualifications, competencies and experience: Strong training and experience in applying 
computational and quantitative approaches to problems in biomedical research. It is anticipated that some 
candidates may be entirely computational, while others will incorporate an experimental component into 
their research; both types are eligible and encouraged to apply. 


Science tine ‘abe 
re In partnership with: Apply Here: Pharmacology: https://unc.peopleadmin.com/postings/239152 
areers OR Genetics: hitps:/7unc.peopleadmin.com/postings/239062 
BYAAAS ——— ———_—_——————————K 
Upload your C.V. to the “curriculum vitae/resume” tab, upload your Cover Letter to the “cover letter” tab, 
BURROUGHS upload your research statement(statement of ongoing and planned research) to the “research statement:”, 
WELLCOME and attached 2-3 recent publications to the “other document 2” and “other document 3” tabs. 
Universo Caoia FUND 
San Francisco ss : : 
The University of North Carolina at Chapel Hill is an equal opportunity and affirmative action employer. 
e All qualified applicants will receive consideration for employment without regard to age, color, disability, 
TA 5 of aD FASEB gender, gender expression, gender identity, genetic information, national origin, race, religion, sex, sexual 
UMASS Medical School = see orientation, or status as a protected veteran. 


KU 


VERSITY OF 


KANSAS 


DISTINGUISHED PROFESSOR, FULL PROFESSOR LEVEL 
University of Kansas School of Pharmacy 


Applications are invited for appointment as a tenured, Distinguished 
Professor at the Full Professor level in the Department of 
Pharmacology & Toxicology, School of Pharmacy at the University 
of Kansas. We are looking for an individual using a systems-based 
approach to understand mechanisms of disease and drug action, 
in order to advance disease prevention, diagnosis, and treatment 
for Alzheimer’s disease or other neurological or neuropsychiatric 
disorder. The successful candidate must hold a PhD, MD, or 
DVM, have an exceptional record of externally funded research 
and previous teaching experience at the undergraduate and/or 
graduate level. The person appointed to this position is expected 
to lead and participate in collaborative research projects. The 
position comes with state-supported salary for nine-months and 
opportunity for grant-supported summer salary for an additional 
3 months. Excellent University-shared core research facilities 
that house specialized scientific instruments and provide expert 
consultation and services to investigators, a generous start-up 
package and endowed funds support the position. 


For more information and to apply, go to: 
https://employment.ku.edu/academic/23365BR 


Review of applications begins November 1, 2022 and will continue 
until the position is filled. 


The University of Kansas is an EO/AAE, full policy at http://policy. 
ku.edu/IOA/nondiscrimination. 


RUTGERS 


Faculty Tenure-track Assistant Professor 


The Department of Biological Sciences at Rutgers University-Newark invites 
applications for two tenure-track Assistant Professor positions in the broadly 
defined field of cellular and molecular biology. Applicants using biochemical, 
molecular, genetic, bioinformatics (OMICS) and/or cellular approaches to 
study problems in the areas of neurobiology, cancer biology, developmental 
biology, stem cell biology, immunology, systems biology, microbiology, or 
related areas are encouraged to apply. Qualifications include a PhD degree or 
equivalent in the field of cell or molecular biology or related disciplines, and 
at least two years of postdoctoral research experience. Applicants must have a 
strong record of research accomplishments as demonstrated by publications in 
leading peer-reviewed journals. The successful applicant will receive generous 
start-up support and laboratory space in anticipation of developing a robust, 
externally-funded, independent research program and contributing to the 
teaching mission of the department. 


Responsibilities Candidates are expected to develop an extramurally funded 
active research program that will incorporate graduate and undergraduate 
students. Candidates must be dedicated to providing quality instruction 
and undertake independent scholarly activity appropriate for advancement 
within Rutgers University. Responsibilities also include departmental and 
university service with a willingness to take on administrative responsibility. 


Requirements Qualifications include a PhD degree or equivalent in cell or 
molecular biology or related disciplines and at least two years of postdoctoral 
research experience. Applications must have a strong record of research 
accomplishments as demonstrated by publications in leading peer-reviewed 
journals, demonstrating the ability to build a body of recognized work 
suitable for tenure at a research university. 


Review of applications will begin December 1, 2022, and continue until the 
positions are filled. 


Applications Please visit: https://jobs.rutgers.edu/postings/175954 
Rutgers is an Affirmative Action/Equal Opportunity Employer. 


WORKING LIFE 


By Naim Rashid 


1574 


Let’s talk about authorship 


s a Ph.D. student, I was excited when a friend proposed we collaborate on a project. I asked 
my supervisor for permission and he happily agreed, but advised that we should define our 
expected contributions and credit before starting. I didn’t heed his words. I wasn’t going to 
negotiate with a fast friend; I fully trusted my collaborator to give me fair credit, and I did 
not want to come across as entitled. Everything went fine. We completed and published the 
work as co-authors. Years passed, and I forgot my supervisor’s advice. 


As I moved into an assistant pro- 
fessor position, I knew publishing 
was crucial for my career. Initially I 
didn’t have the funding or projects 
to recruit my own graduate stu- 
dents, so I co-supervised students 
who were under the primary super- 
vision of other colleagues. I shared 
the intellectual burden and was a 
co-author on the resulting papers. 
My colleagues and I were quite 
satisfied with this win-win situa- 
tion, and everything seemed to be 
going smoothly—until I moved to 
another institution. 

Just a few months after leaving, 
I was browsing the literature and 
found a new article published by a 
student I had co-supervised. I was 
shocked and disappointed to see 
that I was not listed as an author or 
even acknowledged. I brought it up 
to the other supervisor, who was a close friend at the time. 
He told me they had just overlooked my contribution, but I 
doubted that. Plenty of other authors on the paper seemed 
to have contributed less to the work. We couldn’t come to 
a happy resolution. We never collaborated again, and our 
relationship was ruined. Years passed, and I nearly forgot 
this difficult experience, which I saw as an unfortunate one- 
time dispute. 

My career continued to progress through a number of 
countries and institutions, and at one point I was part of 
a team of investigators sharing funding and students. We 
agreed that any publication from our team would include 
the name of every team member. In hindsight I recognize 
this is not the best practice for ethical research and publish- 
ing, as some authors would get credit for minimal contri- 
butions, but we were not professionally mature enough to 
realize this. 

Over the following years we all moved on to different in- 
stitutions, and I noticed my colleagues no longer included 
me on papers about work that began during our time to- 
gether. I took this to mean our agreement was concluded, 


“| make sure to have a proactive 
discussion to clarify 
each team member's role.” 


and I stopped including them on 
my papers as well. So, I was sur- 
prised when I got a call from one of 
those colleagues, upset that I hadn’t 
included him as a co-author. I tried 
to explain, but he felt wronged. 

Remembering the time I had 
been on the other side of the coin, I 
was overwhelmed with regret and 
doubt. Who was wrong in each dis- 
pute, I or my colleagues? For days, 
I told myself that I had done no 
wrong in either case—until, while 
I was out for a morning riverside 
walk, my supervisor’s advice from 
years ago flashed to mind. “Naim— 
you can collaborate, but define 
your role and credit first.” 

I realized that in both cases, I 
had made a mistake. I should have 
initiated explicit conversations 
about authorship with my col- 
leagues from the outset, regardless of our friendships. And 
we should have had follow-up discussions along the way, 
including when individuals changed institutions. 

Thankfully, I was able to repair my relationship with 
my second colleague through further conversations. 
And since then, I have transformed my approach to col- 
laboration and authorship. Before embarking on a proj- 
ect, I make sure to have a proactive discussion to clarify 
each team member’s role and expected credit, acknowledg- 
ing that contributions may change over the course of the 
project and we may need to revisit our agreement. Some- 
times my colleagues seem offended, as if I am treating them 
not as a friendly partner, but as a cold professional, and I 
have lost several potential collaborations along the way. But 
painful experiences have shown me time and again that this 
is the best approach to foster long-term collaborations, en- 
sure everyone gets the credit they are due, and ultimately 
protect our personal and professional relationships. 


Naim Rashid is a researcher at Hamad Bin Khalifa University in Qatar. 
Send your career story to SciCareerEditor@aaas.org. 
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Publish your research in the Science family of journals 


The Science family of journals (Science, Science Advances, Science Immunology, Science 
Robotics, Science Signaling, and Science Translational Medicine) are among the most highly- 
regarded journals in the world for quality and selectivity. Our peer-reviewed journals are 
committed to publishing cutting-edge research, incisive scientific commentary, and insights 
on what's important to the scientific world at the highest standards. 


Submit your research today! 
Learn more at Science.org/journals 
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